MATHEMATICAL ANALYSIS 
OF PHYSICAL PROBLEMS 


Philip R. Wallace 


DOVER BOOKS ON PHYSICS 


THE DEVELOPMENT OF X-Ray ANALYSIS, Sir Lawrence Bragg. (67316-2) $8.95 
SURVEY OF PHYSICAL THEORY, Max Planck. (67867-9) $7.95 
INTRODUCTION TO HAMILTONIAN Optics, H.A. Buchdahl. (67597-1) $10.95 


METHODS OF QUANTUM FIELD THEORY IN STATISTICAL Puysics, A.A. Abrikosov 
et al. (63228-8) $9.95 


ELECTRODYNAMICS AND CLASSICAL THEORY OF FIELDS AND PARTICLES, A.O. Barut. 
(64038-8) $7.95 


QUANTUM THEORY, David Bohm. (65969-0) $14.95 
ATOMIC PHYSICS (8TH EDITION), Max Born. (65984-4) $12.95 
EINSTEIN'S THEORY OF RELATIVITY, Max Born. (60769-0) $9.95 


MATHEMATICS OF CLASSICAL AND QUANTUM Puysics, Frederick W. Byron, Jr. 
and Robert W. Fuller. (67164-X) $18.95 


MECHANICS, J.P. Den Hartog. (60754-2) $11.95 


INVESTIGATIONS ON THE THEORY OF THE BROWNIAN MOVEMENT, Albert Einstein. 
(60304-0) $5.95 


THE PRINCIPLE OF RELATIVITY, Albert Einstein, et al. (60081-5) $6.95 


THE PHysics OF WAVES, William C. Elmore and Mark A. Heald. (64926-1) 
$12.95 


THERMODYNAMICS, Enrico Fermi. (60361-X) $6.95 
INTRODUCTION TO MODERN OpTics, Grant R. Fowles. (65957-7) $11.95 
DIALOGUES CONCERNING Two NEw SCIENCES, Galileo Galilei. (60099-8) $8.95 


GROUP THEORY AND ITS APPLICATION TO PHYSICAL PROBLEMS, Morton 
Hamermesh. (66181-4) $12.95 


ELECTRONIC STRUCTURE AND THE PROPERTIES OF SOLIDS: THE PHYSICS OF THE 
CHEMICAL BOND, Walter A. Harrison. (66021-4) $16.95 


SOLID STATE THEORY, Walter A. Harrison. (63948-7) $14.95 


PHYSICAL PRINCIPLES OF THE QUANTUM THEORY, Werner Heisenberg. 
(60113-7) $6.95 


ATOMIC SPECTRA AND ATOMIC STRUCTURE, Gerhard Herzberg. (60115-3) $7.95 


AN INTRODUCTION TO STATISTICAL THERMODYNAMICS, Terrell L. Hill. (65242-4) 
$12.95 


OPTICS AND OPTICAL INSTRUMENTS: AN INTRODUCTION, B.K. Johnson. (60642-2) 
$6.95 


THEORETICAL SOLID STATE Puysics, VOLs. I & II, William Jones and Norman 
H. March. (65015-4, 65016-2) $33.90 


THEORETICAL Puysics, Georg Joos, with Ira M. Freeman. (65227-0) $21.95 
MOLECULAR ROTATION SPECTRA, H.W. Kroto. (67259-X) $10.95 


THE VARIATIONAL PRINCIPLES OF MECHANICS, Cornelius Lanczos. (65067-7) 
$12.95 


A GUIDE TO FEYNMAN DIAGRAMS IN THE MANY-BopDy PROBLEM, Richard D. 
Mattuck. (67047-3) $11.95 


MATTER AND Motion, James Clerk Maxwell. (66895-9) $6.95 


(continued on back flap) 


MATHEMATICAL 
ANALYSIS OF 
PHYSICAL PROBLEMS 


Philip R. Wallace 
McGill University 


DOVER PUBLICATIONS, INC. 
New Yor 


Copyright © 1972, 1984 by Philip R. Wallace. 
All rights reserved under Pan American and International Copyright 
Conventions. 


Published in Canada by General Publishing Company, Ltd., 30 Lesmill 
Road, Don Mills, Toronto, Ontario. 
Published in the United Kingdom by Constable and Company, Ltd. 


This Dover edition, first published in 1984, is an unabridged and corrected 
republication of the work first published by Holt, Rinehart and Winston, 
Inc., N.Y., in 1972. 


Manufactured in the United States of America 
Dover Publications, Inc., 31 East 2nd Street, Mineola, N.Y. 11501 


Library of Congress Cataloging in Publication Data 


Wallace, Philip R. (Philip Russell) 1915- 
Mathematical analysis of physical problems. 


Reprint. Originally published: New York : Holt, Rinehart and Winston, 
1972. 

Bibliography: p. 

Includes index. 

1. Mathematical physics. I. Title. 
QC20.W27 1984 530.1'55 83-20640 
ISBN 0-486-64676-9 


To my wife Jean, for her patience and 
understanding. 

And to the memory of two men who brought 
both dignity and excitement to the study of 
physics: Leopold Infeld and Georges Placzek. 


PREFACE 


“After having spent years trying to be accurate, we 
must spend as many more in discovering when and 
how to be inaccurate”’ 

Ambrose Bierce 


In writing a book on mathematical physics, the author is from the beginning 
faced with the fact that he has not chosen a uniquely defined subject. The 
task would have been easier in the nineteenth century, when physics was 
somewhat more stable and mathematics and physics lived in closer 
association with each other. Today, the words ‘‘mathematical physics” 
have, by fairly general consent, in North America at least, taken on a 
special and rather conventional meaning, though there is still some 
fuzziness at the boundaries with other areas. As exemplified by the 
American Physical Society’s journal of that name, the phrase signifies 
the investigation of mathematical techniques current in physics. The 
emphasis is on methodology rather than on physical content. In other 
places and other times, however, the separation between form and content 
is, and has been, less marked. 

It is important, first, to understand the difference in purpose of a book 
such as this and a book on mathematics per se. Modern mathematics is 
primarily concerned with axiomatic systems and formal deductions made 
within these systems. In this sense, it consists of games: The axiomatic 
system defines the rules of the game, and one then plays according to 


these rules. The rules need not have any obvious relation to the world in 
which we live. 


vii 


Viii Preface 


The physicist, on the other hand, is concerned with understanding 
and describing the physical aspects of the world in which he lives. It is 
this which determines and governs his “‘mathematics.”’ For him mathe- 
matics is a language, a shorthand, for describing and coordinating his 
comprehension of that world. The mathematician is concerned with rigor 
and with the completeness and logical consistency of his systems. The 
“rules of the game’ must be fully defined and adhered to. The criterion 
which the physicist applies to his ‘“‘mathematics” is its conformity to 
nature. Mathematical symbols represent the magnitudes of physical 
-quantities, and ‘‘mathematical” formulas express the relations between 
these quantities. His systems are, and can be, only as complete as his 
comprehension. As for consistency, it is assured by the correspondence 
between his mathematical models and physical reality; he is prepared to 
assume the internal consistency of nature! Thus, as has been pointed out 
by Landau (and many others), mathematical rigor has no relevance to 
physics. 

This must be the major feature distinguishing a book on mathematics 
from one on mathematical physics. The one proceeds axiomatically, 
governed by the requirements of rigor. The other tries to construct 
workable (and necessarily approximate and incomplete) models of 
aspects of physical reality. To attempt to judge either by the standards of 
the other is therefore inappropriate and mistaken. 

When, in the present book, mathematical structures are described (as 
in chapter 2 on Linear Vector Spaces), they should be considered as the 
description of a common framework within which theories can be con- 
structed of a variety of different sorts of physical phenomenon. The 
important questions are, whether the mathematical framework serves as 
an effective description of the physics, and whether it provides us with 
answers to physical problems which can be tested by physical observa- 
tion. No greater mathematical generality is relevant than is required to 
describe the phenomena with which we are concerned. 

Given this general definition of our aims, a few words should be said 
about the particular material which we have chosen to include in this 
book. Clearly, any book of this sort could not be ‘‘complete’’ without 
being encyclopaedic; it must represent an arbitrary selection of topics for 
discussion. Our selection has been chosen to link classical and modern 
physics through common techniques and concepts. The first chapter is on 
vibrating strings, in other words, on problems of one-dimensional wave 
propagation. Aside from recognizing the wide importance of wave 
phenomena in physics, this subject provides us with a testing ground fora 
wide range of concepts and methods with wider relevance, and does so in 
a simple and familiar physical context. Thus, when similar problems are 
met in newer and less familiar physical contexts, it will not be necessary 
to cope with technical difficulties as well as conceptual ones. 


Preface ix 


Chapter 2, on linear vector spaces, provides the thread which ties 
together most of the rest of the book. This is, of course, the basic con- 
ceptual framework of quantum mechanics; it also unifies our treatment of 
the problems of classical physics and provides us with some essential 
mathematical tools. 

In chapter 3 we introduce the problems of potentials and the Laplace 
and Poisson differential equations. Since these are three-dimensional 
problems, we are led to introduce the method of separation of variables. 
An important feature is the introduction of spherical harmonics, which 
reappear later in other problems, including that of angular momentum in 
quantum mechanics. We have also tried, in this chapter, to illustrate how 
physical considerations can be used to provide the motivation for the 
development of mathematical techniques. 

The fourth chapter is again primarily a ‘mathematical’ one. It is 
concerned with the methods of Laplace and Fourier transforms and the 
relation between them. The methods are illustrated with a selection of 
useful examples. 

Chapters 5 and 6 deal with important classes of problems in classical 
physics. Chapter 5 is concerned with problems of wave propagation in 
three dimensions, and chapter 6 with problems primarily of a diffusive 
character. Problems of the propagation of electromagnetic waves are 
considered at some length. Our intention is not to replace a more extensive 
and detailed treatment such as given in Jackson’s book; chapter 5 does (along 
with chapter 3) provide a somewhat more streamlined account of some of 
the basic problems of the field. 

Chapter 7 is concerned with probabilistic methods in physics. Some 
basic methods and concepts are introduced, and the basis is provided of a 
more fundamental description of some of the diffusive processes dealt 
with in chapter 6. 

The final three chapters are devoted to a discussion of the funda- 
mentals of quantum mechanics. Any treatment of methods of mathemati- 
cal physics which was arbitrarily confined to ‘“‘classical’”’ physics would be 
artificial, as well as foregoing the pedagogical advantage of exploiting the 
technical similarities of many classical and quantum problems. Our 
emphasis is on useful techniques in quantum mechanics. In chapter 8 we 
deal with problems of a quite general nature, including a fairly extensive 
treatment of the time evolution of quantum systems, and of perturbation 
theory. In chapter 9 we deal with the mathematical theory of some stand- 
ard problems: the hydrogen atom, the harmonic oscillator in one, two, 
and three dimensionals, and angular momentum theory. Extensive use is 
made of ladder (or creation and annihilation) operators, which provide 
the framework for the introduction of elementary field theoretical 
methods in the last chapter. Chapter 10 provides an introduction to many- 
body problems using the occupation number representation, and intro- 


x Preface 


duces a number of fundamental problems: the Hartree-Fock method, 
the electron gas, and the theory of density matrices and linear response. 

As for omissions, perhaps the most striking is that of group theory. 
This is partly arbitrary, since any such book must be selective in its 
coverage; an additional justification is, however, that it is a difficult 
subject in which to give an adequate and self-contained treatment of 
modest length, while for longer and more extensive treatments, there have 
appeared in recent years a very large number of excellent books on 
group-theoretical methods in physics. 

It is evident that, given the rapid evolution of physics curricula at the 
present time, and the great variety of ways in which the sort of mathe- 
matical physics incorporated in this book. is distributed among course 
units in different universities, it is almost impossible to write a “text book”’ 
which will correspond simultaneously to the patterns of a very large 
number of universities. It seems to the author, however, that the very 
notion of a course textbook at this level is unrealistic, and that the 
advanced undergraduate or beginning graduate student should use a 
number of books in any course. My purpose ts, therefore, to produce a 
generally useful book; one which will be a valuable addition to the 
personal library of students of physics. If it fits the needs of particular 
courses, so much the better. 

A final word about problems. We have departed from the usual 
practice of providing collections of problems at the ends of chapters, but 
have instead interspersed them throughout the text. Some of these, in 
fact, form an integral part of the text, and will serve to test the student’s 
grasp of what he has read, and his ability to extend it. Others are exercises 
in the techniques dealt with in the text. An effort has been made to avoid 
“problems for problems’ sake.’’ On the other hand, it is impossible to 
overemphasize the importance of the student “doing”’ for himself, and not 
merely learning theory by memory. I have tried, in many of the problems, 
to raise the sort of question which a good student might ask himself. If he 
is encouraged to ask further questions, and to seek their answers, their 
purpose will have been served. 

I should like to express my thanks to Dr. Robert Heck for carefully 
checking the manuscript. 

P. R. Wallace 


CONTENTS 


Preface Vii 
References XVII 


PRELUDE TO CHAPTER 1 I 
THE VIBRATING STRING 5 


Introduction 5 

Derivation of the Equation of Motion 5 

Solution of the Equation 8 

Energy of the String 15 

Energy in the Harmonics 18 

The “Loaded” String 19 

Reflection and Transmission at a Fixed Mass 20 

Propagation on a String with Regularly Spaced Masses Attached 24 
Reflection by and Transmission through a Section of Different Density 28 
Inhomogeneous String and the Method of Separation of Variables 30 
Boundary Conditions and the Eigenvalue Problem 34 

Orthogonality of Eigenfunctions 35 

Rayleigh-Ritz Variational Principle 36 

Approximate Calculation of Eigenvalues from the Variational Principle 
38 

Expansion in Eigenfunctions 42 

The Inhomogeneous Problem for the Vibrating String 47 

Green’s Function 50 

Effect of a Perturbation of Density 53 


xi 


xii Contents 


19. The JWKB Method 55 

20. AnExample 56 

21. Lagrangian and Hamiltonian Formulations of the Vibrating String 
Problem 58 


PRELUDE TO CHAPTER 2 67 


LINEAR VECTOR SPACES 71 


Introduction 71 

Vector Spaces 73 

Linear Independence, Dimensionality, and Bases 73 
Scalar Products 74 

Schmidt Inequality and Orthogonalization 75 

An Example of the Schmidt Procedure 77 

Matrix Representation of Vectors and Transformation of Basis 
Linear Operators and Their Matrix Representations 84 
The Eigenvalue Problem for Hermitian Operators 86 
Another Example 89 

Sturm-Liouville Problem and Linear Vector Spaces 91 


m= OWANAMSYNE 


pot ph 


PRELUDE TO CHAPTER 3 97 
THE POTENTIAL EQUATION 101 


An Alternative Derivation of Legendre Polynomials: Multipoles 
Multipole without Axial Symmetry and Associated Legendre 
Functions 118 

9. An Addition Theorem for Spherical Harmonics 122 
10. Potential of a Given Charge Distribution 125 
11. Potential of an Axially Symmetric Charge Distribution 127 
12. Potentials of Charge Distributions under Various Boundary 

Conditions — Green’s Functions 130 

13. Further Problems Involving the Potential Equation 138 
Appendix 3A: A Recursion Relation for P;”(cos @) 145 
Appendix 3B: Review of Theory of Linear Differential Equations 
of the Second Order 145 


1.. Introduction: Electrostatic Potential 101 

2. Solution of Laplace’s Equation in Spherical Coordinates 105 
3. The 6 Equation and the Factorization Method 106 

4. Spherical Harmonics 111 

5. Radial Solution and the General Solution of Laplace’s Equation 
6. Legendre Polynomials 112 

7. 

8. 


PRELUDE TO CHAPTER 4 149 


FOURIER AND LAPLACE TRANSFORMS AND THEIR APPLICATIONS 


1. Introduction: Fourier Transform in One Dimension 153 
2. The Convolution Theorem 157 
3. Causality and Dispersion Relations 161 


82 


111 


116 


153 


Linear Response Functions 163 
Cross-Correlation and Autocorrelation Functions 164 
Fourier Transform in Three Dimensions 166 


Solution of Poisson’s Equation by Fourier Transforms 


Poisson’s Summation Formula in One and Three Dimensions 


Contents 


168 


A Note on Delta Functions and Three-Dimensional Transforms 


Two- and Three-Center Integrals 178 

Laplace Transforms 182 

Transforms of Derivatives 182 

‘Shifting’? Theorem 183 

Convolution Theorem 183 

Some Simple Transforms 184 

Laplace Transform of a Periodic Function 189 
Resonance 190 


Use of Laplace and Fourier Transforms to Solve the Vibrating 


String Problem 193 

The Gamma Function 195 
Stirling’s Formula 196 
The Beta Function 198 


Use of Transforms for Equations with Linear Coefficients 


The Confluent Hypergeometric Function 201 
Laguerre Functions 205 
Hermite Functions 207 


Equations Reducible to the Confluent Hypergeometric: 


The Bessel Equation 211 

General Properties of Bessel Functions 212 
Second Solution of the Bessel Equation 215 

Zeros of Bessel Functions 215 

Hankel Functions 217 

Further Formulas Involving Bessel Functions 221 


PRELUDE TO CHAPTER 5 225 
PROPAGATION AND SCATTERING OF WAVES 231 


Introduction 231 

Sound Waves. Derivation of the Equations 232 
Dynamics of Sound Waves 237 

Lagrangian and Hamiltonian Formulation 239 
Guided Waves 242 

Wave Equation with Sources 244 

Spherical Waves 249 

Expansion of a Plane Wave in Spherical Waves 252 
Radiation from a Periodic Source 254 
Time-Varying Source 259 

Radiation from a Moving Source 260 

Solution with Initial and Boundary Conditions 264 
Waves in Guides and Enclosures 268 

Spherical Enclosure 270 


199 


169 


xiii 


xiv Contents 


15. Propagation Down a Cylindrical Tube 270 
16. Scattering of Sound Waves 272 
17. Inequalities Satisfied by the Cross Sections 277 
18. Elastic Scattering by Small Sphere 280 
19. Propagation of Electromagnetic Waves 281 
20. “Spin” of Scalar and Vector Fields 288 
21. Interaction of Fields and Particles and the Dynamics of the 

Electromagnetic Field 291 
22. Lagrangian and Hamiltonian of the Field 295 
23. Normal Modes of the Electromagnetic Field 297 
24. Coulomb Gauge and Pure Radiation Field 301 
25. Plane Waves 301 
26. Plane Waves as Normal Modes 304 
27. Radiation from Given Sources: Multipole Fields 305 
28. Radiation from a Uniformly Charged Vibrating Liquid Drop 316 
29. Radiation from an Accelerating Point Charge 322 
30. Motion ina Straight Line: Cerenkov Radiation 324 
31. Plasmas ina Magnetic Field 326 
32. The Dielectric Tensor 329 
33. Excitation of Helicons by a Plane Wave Normally Incident on 

an Interface 337 

Appendix 5A: Method of Steepest Descents 338 

(i). Approximate Formulas for Spherical Bessel Functions of Large 
Order 338 
(ii). Example: Asymptotic Expansion of the Gamma Function 340 
(iii). The Spherical Hankel Functions 341 
Appendix SB: Units in Electromagnetic Theory 348 


PRELUDE TO CHAPTER 6 351 


PROBLEMS OF DIFFUSION AND ATTENUATION 355 


1. Introduction 355 
2. Diffusion in a Gas ora Solid 357 
3. Conduction of Heat in a Solid 358 
4. Green’s Functions for Diffusion and Heat Conduction Problems 360 
5. Green’s Function in One Dimension— Semi-Infinite and Finite Media 361 
6. Green’s Functions — Spherical Geometry 369 
7. General Problem with Initial and Boundary Conditions 372 
8. Green’s Functions and the Free-Particle Schrédinger Equation 376 
9. Diffusion of Neutrons 378 
10. Diffusion of “Thermal”? Neutrons 379 
11. Moderation of Fast Neutrons 383 
12. Model of a Neutron Chain Reactor 385 
13. The Problem of Criticality 386 
14. Solution of the Steady-State Equations 390 
15. Diffusion Length, Slowing-Down Length, and Migration 
Length 392 


16. 
17. 
18. 
19. 
20. 


Contents 


Electromagnetic Waves in a Dissipative Medium 395 
Propagation into a Conducting Medium 400 

Skin Effect — General Considerations 401 

The Equations of Superconducting Electrodynamics 403 
Meson Equation 411 


PRELUDE TO CHAPTER 7 413 


PROBABILITY AND STOCHASTIC PROCESSES 415 


NAMB YN > 


8. 
9. 
10. 
11. 
12. 


Introduction 415 

Probability Generating Functions 416 

Density Fluctuations in a Gas 421 

The Poisson Distribution 422 

A Problem in Counting of Radioactive Decays 425 
Probability Distributions for Continuous Variables 427 
Distribution of a Sum of Independent Continuous Random 
Variables 429 

Relation between Random Walk Problem and Diffusion 433 
Problems Involving Multiplication 433 

The Markoff Method 436 

Random Distribution of Energy among N Particles 438 
Stochastic Processes 440 


PRELUDE TO CHAPTER 8 443 


FUNDAMENTAL PRINCIPLES OF QUANTUM MECHANICS 447 


Introduction 447 

Hypotheses of Quantum Mechanics 448 
Uncertainty Principle 455 

Unitary Operators 459 

Representation of Vectors by Eigenvectors of Continuous 
Coordinates 461 

Schrodinger Wave Equation 462 

Momentum Representation 465 

Time Evolution of a System 468 

Interaction Representation 471 

Time Evolution Operator 472 

Case of a Time-Independent Perturbation 476 
Some Useful Relations 482 

Calculation of Total Transition Probability 483 
Transition to a Definite Final State 485 

Case of a Time-Dependent Perturbation 488 
First-Order Perturbation 490 

Perturbation Theory for Stationary States 491 
Rayleigh-Schrédinger Perturbation Theory 492 
Degenerate Case 494 

Brillouin-Wigner Perturbation Theory 496 


XV 


xvi Contents 


PRELUDE TO CHAPTER 9 499 


SOME SOLUBLE PROBLEMS OF QUANTUM MECHANICS 503 
1. Introduction 503 
2. Harmonic Oscillator in One Dimension 504 
3. Schrodinger Wave Functions for the Harmonic Oscillator 508 
4. Elements of the Theory of Angular Momentum 510 
5. Matrix Representation of Angular Momentum Operators 515 
6. Schwinger Formulation 517 
7. Addition of Angular Momenta: Clebsch-Gordan Coefficients 520 
8. Techniques of Calculation $22 
9. Two-Dimensional Harmonic Oscillator 527 

10. Three-Dimensional Oscillator $33 

11. Three-Dimensional Oscillator; Direct Solution 540 

12. Hydrogen Atom $42 

13. Positive Energy States and the Problem of Scattering 549 

14. Partial-Wave Theory of Scattering by a Potential 549 

15. Low-Energy Scattering 553 


PRELUDE TO CHAPTER 10 559 


QUANTUM MECHANICS OF MANY-BODY PROBLEMS 563 


Introduction 563 

Commutation Rules for the y Operators 568 

Representation of Two-Particle Operators 569 

Hartree-Fock Method $73 

Hartree-Fock for Free Interacting Particles 579 

Kinetic and Exchange Energies of a ‘‘Fermi Gas” of Electrons 581 
Condition of Instability of the Zero-Spin State 583 

Total Kinetic and Exchange Energies of the Electron Gas 585 
Excited States of Many-Body Systems 586 

Density Matrix 587 

Alternative Method (Fano) 590 

Density Matrix of the Canonical Ensemble 592 

Grand Canonical Ensemble 594 

Bose-Einstein and Fermi-Dirac Distributions 594 

Linear Response Theory 596 

An Example: Transverse Conductivity of an Electron Gas 598 
Propagation of Electromagnetic Waves in an Electron Gas 602 


ce 
IAM P YN RKSOBANAYWRYN TS 


Index 605 


REFERENCES 


We list first a number of general references, largely though not entirely of a 
mathematical character. 


Courant, R. and D. Hilbert, Methods of Mathematical Physics. New York: 
Wiley-Interscience, 1961. 

Goertzel, G. H. and N. Tralli, Some Mathematical Methods of Physics. New 
York: McGraw-Hill, 1960. 

Irving, J. and N. Mullineux, Mathematics in Physics and Engineering. New 
York: Academic, 1959. 

Jeffreys, H. and B. S. Jeffreys, Methods of Mathematical Physics. London: 
Cambridge Press, 1956. 

Margenau, H. and G. M. Murphy, The Mathematics of Physics and Chemistry. 
Princeton, New Jersey, Van Nostrand, 1961. (This book is more oriented 
toward physics than most, and, although old, is somewhat more in the spirit 
of the present book.) 

Morse, P. M. and H. Feshbach, Methods of Theoretical Physics. New York: 
McGraw-Hill, 1953, 2 vols. 

Sommerfeld, A., Partial Differential Equations in Physics. New York: Academic, 
1949. 

Webster, A. G., Partial Differential Equations of Mathematical Physics. New 
York: Dover, 1955. 


XVii 


XViii References 


Whittaker, E. T. and G. N. Watson, A Course of Modern Analysis. London: 
Cambridge Press, 1958. 


There are also several useful references on topics some knowledge of which 
is assumed in the present book, for example, ordinary differential equations and 
complex variable theory, (the knowledge assumed is, however, far less than that 
covered by these books). 


Byerly, W. E., Introduction to the Calculus of Variations. Cambridge, Mass.: 
Harvard University Press, 1933. 

Heins, M., Selected Topics in the Classical Theory of Functions of a Complex 
Variable. New York: Holt, Rinehart and Winston, 1962. 

Ince, E. L., Integration of Ordinary Differential Equations. New York: Wiley- 
Interscience, 1956. 

MacRobert, T. M., Functions of a Complex Variable. London: Macmillan, 
1950, 3rd ed. 

Pennisi, L. L., Elements of Complex Variables. New York: Holt, Rinehart and 
Winston, 1963. 

Smith, L. P., Mathematical Methods for Engineers and Scientists. Englewood 
Cliffs, N.J.: Prentice-Hall, 1953. 


For the student wishing problems supplementary to those given in this book, 
there exist collections of problems, mostly with solutions: 


Cronin, J. A., D. F. Greenberg, and V. Telegdi, University of Chicago Graduate 
Problems in Physics. Reading, Mass.: Addison-Wesley, 1967. 

Goldman, J. J. and V. D. Krivchenkov, Problems in Quantum Mechanics. 
Reading, Mass.: Addison-Wesley, 1961. 

Lebedev, N. N., I. P. Stalskaya and Y. S. Uflyand, Worked Problems in Applied 
Mathematics. New York: Dover, 1979. 

Misyurkeyev, I. V., Problems in Mathematical Physics, New York: McGraw- 
Hill, 1966. . 

Smirnov, M. M., Problems on the Equations of Mathematical Physics. Groningen, 
Netherlands: P. Noordhoff, 1966. 

ter Haar, D., Selected Problems in Quantum Mechanics. \nfosearch, 1964. 


Finally, there are various extremely useful books of numerical tables and tables 
of formulas: 


Abramowitz, M. and I. A. Stegun, Handbook of Mathematical Functions. New 
York: Dover, 1965. 

Dwight, H. B., Tables of Integrals and Other Mathematical Data. London: 
Macmillan, 1957. 

Erdelyi, A. W., W. Magnus, F. Oberhettinger, and F. G. Tricomi, Higher 
Transcendental Functions. New York: McGraw-Hill, 1953, 3 vols. 

Gradshteyn, I. S. and I. N. Ryazhik, Tables of Integrals, Series, and Products. 
New York and London: Academic, 1965. 

Grobner W., and N. Hofreiter, /ntegraltafeln. Berlin: Springer, 1957, 2 vols. 


References xix 


Jahnke, E., F. Emde, and F. Losch, Tables of Higher Functions. New York: 
McGraw-Hill, 1960. 


Jolley, L. B. W., Summation of Series. New York: Dover, 1961. 

Madelung, E., Mathematical Tools for the Physicist. New York: Dover, 1943. 

Magnus, W. and F. Oberhettinger, Formulae and Theorems for the Special 
Functions of Mathematical Physics. New York: Chelsea, 1949. 


Sneddon, I. N., Special Functions of Mathematical Physics and Chemistry. New 
York: Wiley—Interscience, 1956. 


MATHEMATICAL 
ANALYSIS OF 
PHYSICAL PROBLEMS 


PRELUDE TO CHAPTER 1 


In this chapter as in subsequent ones, we shall provide 
an outline of the topics with which the chapter deals. 
In addition to providing a guide to the organization of 
its contents, we indicate the mathematical background 
assumed, and call attention to those methods and con- 
cepts which have an application in different physical 
problems. 

The main theme of Chapter 1 is waves. A useful 
reference is the small book of Coulson (Waves'). We 
deal here only with one-dimensional problems; the 
theme is picked up again, in the context of three dimen- 
sions, in Chapter 5. Many of the concepts and methods 
developed here are capable of generalization to the 
three-dimensional case. 

Because the physical context of our discussion 1s 
that of the vibrating string, we first provide a derivation 
of the equation of motion of a stretched elastic string. 
We then discuss very general solutions of this equation 
for an “infinitely long” string. We first obtain solutions 
in terms of arbitrary functions, which are determined 
by initial conditions. This leads to rather simple physi- 
cal pictures of the motion. 

It is then shown that these solutions may be 
adapted to describe the “reflection” of a wave at an 
end which is fixed. If there are two such fixed ends, it 
is then shown that the solution is periodic, and is thus 
expansible in a Fourier series (an elementary know- 
ledge of Fourier series is assumed). 

The string problem is next discussed from the 
viewpoint of energy. Energy density (energy per unit 
length) and energy flux are defined. Two important 
results are obtained: 

(1) The energies of waves traveling in opposite 
directions are distinct in an infinite string; two such 
waves can pass through each other and emerge un- 


1Publishing information may be found in the list of references concluding the prelude to 
each chapter for books not specifically footnoted throughout. 


2 


Prelude to Chapter 1 


altered. That is, the waves do not scatter each other. 
This is a consequence of the linear approximation. If 
nonlinear terms were kept, they would in fact give rise 
to mutual scattering of waves. 

(2) The velocity of energy transport (group veloc- 
ity) is the same as the phase velocity. 

Turning to harmonics (Fourier components), it is 
verified that each harmonic also has its own energy 
content, determined by initial conditions; there is no 
transfer of energy between harmonics (again, this is a 
consequence of linearity). 

The next problem treated is that of scattering, 
which, in one dimension, is that of determining reflec- 
tion and transmission at some point of irregularity. An 
example is that of reflection at a point mass fixed to the 
string. We have introduced into the problem the con- 
cept of the scattering matrix (S matrix), which plays an 
essential role in modern quantum scattering theory, 
but which here takes a particularly simple form and 
interpretation. 

We then consider the problem of regularly spaced 
scatterers, a prototype of the “‘periodic lattice” type 
of problem which arises in other areas, notably electri- 
cal networks and lattice vibrations (“‘phonons’’) in 
solids [see Brillouin, Wave Propagation in Periodic 
Structures?]. The existence of “forbidden bands” of 
frequency is demonstrated. 

The problem of scattering by a continuous scat- 
terer (e.g., a portion of string of different density) 1s 
also dealt with. This problem is analogous to that of 
“potential barriers” in quantum mechanics, and is 
mathematically almost identical for the one-dimension- 
al problem. 

The treatment, up to this point, has been confined 
to the uniform string. Turning now to the inhomo- 
geneous one, a different approach is adopted, — that of 
separation of variables. This is the almost universal 
method of dealing with partial differential equations 
in mathematical physics. It leads to the so-called 
Sturm—Liouville eigenvalue problem. In the case of the 
homogeneous string, it leads again to the Fourier (har- 


2New York: Dover, 1946. 


Prelude to Chapter 1 3 


monic) expansion. Solutions are sought such that every 
point of the string vibrates with the same frequency. 

It is found that the boundary conditions can in 
general only be satisfied for specific discrete frequen- 
cies, which determine the spectrum of excitations. 
Each such solution has its characteristic pattern 
(normal mode). The orthogonality of the normal 
modes is proven; from this it follows (again, as in the 
homogeneous case) that each normal mode has its 
characteristic energy, and there is no mutual scattering. 

Eigenvalue problems of the type dealt with here 
are the commonest form of problems met in quantum 
mechanics, as is seen in Chapters 8 and 9. 

The next topic introduced is that of the variational 
principle of Rayleigh and Ritz. It is shown that the 
normal modes are those solutions which minimize a 
certain quantity, which is related to the frequency of 
the mode. This principle, which also has wide applica- 
tion in quantum mechanics (especially where exact 
solutions cannot conveniently be found), permits us to 
determine the normal frequencies (particularly the 
lowest ones) approximately. Examples are given. 

It is shown that general solutions may be ex- 
pressed as linear combinations of “normal modes.” 
The variational principle is used in this demonstration. 

The problem of a string acted upon by external 
forces is used to introduce the ““Green’s function” of 
the problem. The Green’s function is the response of 
the string to a “unit impulse” or fundamental disturb- 
ance, spatially localized and instantaneous in time. The 
response to a general disturbance is then obtained by 
adding those of the constituent “‘unit impulses.’’ This 
approach is used in all areas of mathematical physics, 
and is a very fruitful one. The particular case of a 
periodic disturbance, leading to the phenomenon of 
“resonance,” is fully discussed. 

Two useful approximate methods are then intro- 
duced, each of which is best known for its use in quan- 
tum mechanics. These methods are used when exact 
analytical solutions are unattainable. The first is the 
method of “perturbations,” applicable when the prob- 
lem differs only by a small amount from a soluble one. 
The other is the JWKB (Jeffreys—Wentzel-Kramers— 


4 Prelude to Chapter 1 


Brillouin) method, applicable when the properties of 
the string are slowly varying.. Examples are given of 
both methods. 

Finally, the string problem is treated by the 
Lagrangian and Hamiltonian methods of classical 
mechanics. This is a simple prototype of the problem 
of the dynamics of continuous fields. Since Hamiltonian 
mechanics can be used as the basis of the quantum 
mechanics of a problem, methods and concepts simi- 
lar to those developed here (though of course more 
complicated in detail) are fundamental to such prob- 
lems as quantum electrodynamics and quantum field 
theory. 


REFERENCES 
Coulson, C. A., Waves. Edinburgh: Oliver and Boyd, (Sth ed.) 1948. 
Goldstein, H., Classical Mechanics. Reading, Mass.: Addison-Wesley, 1957. 
Lord Rayleigh, Theory of Sound. New York: Dover, 1945, 2 Vols. 
Morse, P. M., Vibration and Sound. New York: McGraw-Hill, 1948, Chapter 3. 


Schwartz, L., Mathematics for the Physical Sciences. Reading, Mass.: Addison- 
Wesley, 1966. 


THE VIBRATING 
STRING 


“Harp not on that string”’ 
Shakespeare, Richard Ill 


1. Introduction 


We begin with a discussion of the stretched vibrating string for a 
number of reasons. In the first place, it is a simple one-dimensional sys- 
tem, and so does not lead to too many mathematical complications. 
Secondly, it gives us an opportunity, in this fairly simple context, to dis- 
cuss the main features of wave propagation. Furthermore, it is a problem 
which may be linked rather directly with the methods of classical mechan- 
ics, which are usually first met for discrete systems but may here be ex- 
tended to a continuous one. Finally, it provides a testing ground for many 
of the concepts and methods which are met later in more complicated 
systems. 


2. Derivation of the Equation of 
Motion 


We concentrate our attention on the purely transverse small vibra- 
tions of an elastic string, such as a violin string, under constant tension. 
Transverse vibrations are defined as those in which each point of the 
string is displaced perpendicular to its length. We deal, later, with a purely 
longitudinal motion, in which each point is displaced along the string. 


6 The Vibrating String 


Let x be a coordinate measured along the length of the string, and 
y(x,t), (t= time), be the transverse displacement at point x and time f. 
Thinking in terms of conventional dynamics, the coordinate x enumerates 
the degrees of freedom of the system, which are here continuous. The 
dynamical variables are the y’s for each value of x and they are, of course, 
functions of the time. The problem is idealized by taking the cross section 
of the string to be negligibly small. 

Because the dynamical variables form a continuous set, a little care 
must be taken to set up the equations of motion. The variables are, of 
course, coupled to each other. The most convenient device to deal with 
this problem is to consider the motion of the center of mass of a very small 
portion of the string. The length of this portion may then be allowed ulti- 
mately to approach zero. 


Figure 1.1 


Let us imagine that we look at a very small portion of the string, be- 
tween points x and x + Ax, under a magnifying glass (Figs. 1.1 and 1.2). 


T 
| 
| | 
| 
7 | 
| 


Figure 1.2 


The only significant forces acting on this portion of string are assumed to 
be the forces of tension on the two ends. Since the slope of the string at 
any point is dy/dx, the component of tension in the y direction at the right- 


hand end is 
/ WG 
: E / sn Ox xr+Ar 


where the subscript x+ Ax indicates that the quantity in the bracket 1s 
evaluated at this point, and T is the force of tension in the string. The 
corresponding component at the left-hand end is 


72 /vi+Q)], 


2. Derivation of the Equation of Motion 7 


The net force in this direction is the difference of the above quantities. 
By the mean value theorem this difference is 


9 oy / . all 

r2[2 / 1+(2 Ax (1.1) 

the whole expression being evaluated at some point between x and x+ Ax. 
Thé mass times y component of acceleration for the segment of string 


Ax| 0 | (1.2) 


p being the density (mass per unit length) of the string. The quantity in the 
square bracket is a mean value on (x, x + Ax); this, once again, is the value 
at some intermediate point. 

If the quantities (1.1) and (1.2) are now equated to express the 
Newtonian law of motion for the element of string, and Ax is allowed to 
approach zero, the equation of motion is 


12.2 / eG 
pSz=-72|2 a ey i>) 


If, for some reason, the tension were dependent on position, represented 
by afunction 7(x), it would appear under the derivative. 

Equation (1.3) is nonlinear and is difficult to solve. The situation is 
simplified, however, if dy/ax < 1, in which case the square root may be 
replaced by unity. The equation then becomes 


oy _ 0 ) 
Pat? ax (7 ax aT) 
This is the equation for small vibrations, by which we mean vibrations for 


which the slope dy/dx is very small. It is this equation which we study in 
some detail. 


Assuming T and p constant, and putting 


Tip=c? (1.5) 
we obtain the one-dimensional wave equation 


i 


iS 


a*y _ | d*y 
ax? c* ot? 1:9) 

A similar equation holds for purely longitudinal oscillations. We let y 
represent the displacement of a point x on the string from its equilibrium 
position. We define the strain in the string as the extension per unit length; 


this quantity varies from point to point. The portion of the string initially 
between x and x+ Ax is stretched by an amount 


y(x+Ax) —y(x) 


8 The Vibrating String 


so that the strain is dy/dx. y is now the longitudinal displacement. 

Let the “stress” be defined as the excess over its equilibrium value 
of the tension of the string at a point. It is then assumed that the stress is 
proportional to the strain, being equal to 


dy 
x Ox 
where k is the Young’s Modulus. 

As in the case of transverse displacement, the net force acting on the 
element Ax is the excess of the stress at x-+ Ax over that at x; thus, the 
force per unit length is 

9 (1, 9Y 
Ox (i a4 


k may, of course, be variable if the wire is not homogeneous. Since the 
mass per unit length times the acceleration is equal to the force per unit 
length, the equation of motion is 


ay 2. (422 
Pat? ax (i ax da) 


which is formally the same as (1.4). 


3. Solution of the Equation 


It may be immediately verified that 


y =f, (x—ctt) (1.8) 
and 


y=f,(x+ct) (1.9) 


are solutions of the equation, where f, and f, are arbitrary functions. Each 
of these solutions has a simple physical interpretation. In the case of (1.8), 
the displacement y at point x and time fis the same as that at point x + cAt 
at time t+ At. Thus, the whole pattern of disturbance of the string moves 
in the direction of increasing x with a velocity c. cis therefore the “*veloc- 
ity of propagation” of the disturbance. 

In the case of (1.9), on the other hand, the displacement at point x at 
time ¢ is found, at time t+ At, at the point with coordinate x—cAt. This 
disturbance, therefore, travels in the direction of decreasing x with 
velocity c. 

The general solution of (1.6) is the sum of solutions such as (1.8) 
and (1.9): 


y=f,(x—ct) +fh(x+ ct) (1.10) 


3. Solution of the Equation 9 


The functions f, and f, are determined by the initial conditions of the 
motion. We know from general dynamical principles that if the initial 
values of the coordinates and their time rates of change (velocities) are 
known, the motion is completely determined. Let these initial conditions 
be that 


y=yo(x) at t=0 (1.11) 
and 
= v9(x) at t=0 (1.12) 


If they are imposed on the solution (1.10), f, and f, are expressed in terms 
of yo(x) and v9 (x) by the equations 


Yo=hith (1.13) 
and 


Vo = —chi +cfe (1.14) 


The primes represent, in the first instance, derivatives of the functions f, 
and f, with respect to their arguments; however, at t= 0 the argument 
is simply x. 

Equations (1.13) and (1.14) may be solved for f, and ft: 


fix) =4] yolx) 2 | volx")ax"| (1.15) 


and 


fax) =A] yolx) +4 [ vo(x")a'| (1.16) 


The lower limit x, on the integrals is arbitrary; in any case, it does not 
appear in the complete solution, which Is 


xt+cet 


y(x,t) = al yo(x— et) + yo(x+ ct) +/ Vo(x’) dx | (1.17) 


xz—ct 
This solution has a simple physical interpretation, which is most 
easily understood if we consider first two particular cases, in which 
Vo(x) = 0 and y,(x) = 0, respectively. The two particular solutions may 
subsequently be combined to give (1.17). 
When vy = 0 the motion is given by 


y = $1yo(x—ct) t+ yo(x+ cr) ] (1.18) 


This says that half of the initial disturbance travels in the positive direc- 
tion and half in the negative direction. This situation is illustrated in 
Fig. 1.3. 


10 The Vibrating String 


Figure 1.3(a) shows the initial state. Figure 1.3(b) shows the dis- 
placement a little later, while the two displacements are still partially 
overlapping. Figure 1.3(c) shows the situation at a later time, when they 
are completely separated and traveling away from each other. 


oP 
(b) ——— 
<< —_> 
ee i ee a ee 
Figure 1.3 


The case just discussed is that in which the string is given an initial 
displacement, and released from rest. Consider now the case in which 
there is no initial displacement, but an impulse represented by a velocity 
distribution v,(x). The solution in this case, which in general must be 
added to the previous one, Is 


y=4lyi(x+ct) —y1(x—ct) ] (1.19) 


where 


x= 
y,(x) =1{ Vo(x') dx’ (1.20) 
Zo 

This solution again consists of two parts, representing disturbances which 
travel in opposite directions, but also in this instance have opposite signs. 
If vy > O, the disturbance traveling to the left is positive, while that travel- 
ing to the right is negative. At t= 0, of course, they completely cancel. 
The situation is portrayed in Fig. 1.4. Figure 1.4(a) shows an initial veloc- 
ity distribution of a simple sort. Figure 1.4(b) displays the functions y,, 
— y,. In Fig. 1.4(c), the disturbances have moved a small distance relative 
to each other. In Fig. 1.4(d), they have moved much further. 


3. Solution of the Equation 11 


The preceding discussion deals with the propagation of disturbances 
on a string of indefinite length, that is, one for which the effect of end 
conditions may be neglected. Suppose, however, the string is finite, the 
ends being at x= 0 and x= L. Then, the functions y)(x) and vo(x) are 
defined only on this range. Clearly, however, (1.17) requires them to be 
defined for all values. The problem is, then, how to extend the definition 
of these functions to arbitrary values of the argument. 


Figure 1.4 


Again, for simplicity, consider first the case in which vy = 0, and in 
which there is a single fixed end at x = 0. If the solution (1.18) is to satisfy 
this condition it is necessary that, for all values of f, 


yo(— ct) = — y, (ct) (1.21) 


The boundary condition then tells us how to extend the function into the 
nonphysical region; it is simply to be defined as an odd function with 
respect to the end point. Once this condition has been applied, the equa- 
tion (1.18) may be used, and can be used to describe the reflection of a 
wave from the end. This is illustrated in Fig. 1.5. In (a) there is shown an 
initial disturbance, along with its “reflection” in the fixed end. Half of it 
travels in each direction, and in (b) the half which is traveling to the left 


12 The Vibrating String 


is shown at the moment at which it first reaches the end. Thereafter, the 
part of the “reflection’”’ which is traveling to the right begins to overlap 
it, and must be added to it. Successive stages of this interference are shown 
in (c) and (d). Finally, the original disturbance has passed completely into 
the unphysical region to the left, while that which was originally in that 
region travels on the string toward the right, as shown in (e). 


(a) 


(b) 


(d) 


(e) 


For clarity, (c), (d) and (e) are drawn on a larger scale than (a) and (b). 


Figure 1.5 


Physically, the process of reflection at the fixed end of the string is 
then seen as areversal in direction of propagation of the disturbance which 
impinges on the end, as well as an inversion (reversal in the direction of 
displacement). 


3. Solution of the Equation 13 


In the case of a motion resulting from an initial velocity distribution 
Vo it is evident from Eq. (1.19) that the function y,(x) must be an even 
function; that is to say, that 


f V(x’ )dx' = i- Vo (x’ ) dx’ 


where we have again taken the fixed end at x = 0. Differentiation of both 
sides of the equation with respect to x gives 


Vo(xX) = — vo(— x) (1.22) 


so that the definition of vg, too, must be extended to make it an odd func- 
tion. The lower limit in the integral defining y, can conveniently be taken 
to be zero. Although y, in the nonphysical region is of the same sign as in 
the physical region, it appears with a minus sign in the solution. Thus, 
again, one adds to the “‘real’’ disturbance its inverted reflection in the end 
of the string. 

Suppose now that there are two fixed ends, at x = 0 and atx = L. In 
physical terms, we can envisage the motion as a propagation of the initial 
disturbance followed by successive reflections of its parts at the two ends; 
the reflected disturbances will then interfere with, that is to say, be added 
to, each other. Although the resulting disturbance may at first sight appear 
to be quite complicated, it may in fact be visualized in a rather simple way. 
Again, the most convenient device is to extend the definition of the dis- 
placement function y(x) outside its original range of definition (0 = x < L). 
As already seen, it must be an odd function with respect to each end. In 
mathematical terms, this is expressed by the equations 


y(—x) =—y(x) (1.23) 
and 
y(-x+L) =—y(x+L) (1.24) 
Combining these two equations yields the interesting result that 
y(x+L) = y(x—L) (1.25) 


so that the function must be extended in such a way as to be periodic with 
a period equal to twice the length of the string. The complete function can 
then be simply constructed as follows: The physical displacement on the 
range (0, ) is inverted and reflected in the region (0,— ZL), and then 
extended over successive regions of length 2L by periodicity. ‘To state it 
differently, the function may be constructed from its value in (0, L) by 
inverted reflections in its two ends, followed by their inverted reflections 
in the two ends, and so forth, thus producing an infinite sequence of 
images.” 


14 The Vibrating String 


To describe the motion, the resulting function must be considered as 
consisting of two equal parts, which move with velocity c in opposite 
directions, and interfere with each other. 


Problem 1-1: Describe in detail the reflection at an 
end at which the boundary condition is dy/ax = 0. (This is an end which is 
free to move transversely, since the tension can do no work on the 
support.) 


The fact of periodicity may be used to express the solution in a rather 
different form. For, 1f yp and vy are odd functions with period 2L, they may 
be expressed in Fourier sine series on that interval: 


Yo(x) = S. a, sin (1.26) 
n=1 


U(x) = b,, sin _ (1.27) 
1 


r= 


where the coefficients a, and b, may be obtained from the standard 
formulas in terms of the given initial conditions: 


me out) ce ATE 
an =F Yo(x’) sin L dx (1.28) 
and 
ee i -  Amx' ,, 
bn =F I. Vo(x’) sin —— dx (1.29) 


Substituting the series (1.26) and (1.27) into the general solution (1.17) 
yields the solution in the form 


_i1 . nm(x—ct) ... nw(x+ct) 
y=5 S an| sin ee + sin ar ae 


1 L ni(x—ct) _ na (x+ct) 
+ 7 > On =| cos 0S 


=¥ sin a cos “5S — b, sin meet (1.30) 
We see later that this solution is the one obtained by a quite different 
approach — that of separation of variables. This latter method is, however, 
more flexible, since it can be extended to the case of variable p and T. The 
present approach runs into difficulties in that case, in that it could only be 
pursued if one could write a general solution of the differential equation 
analogous to (1.10). 


4. Energy of the String 15 


4. Energy of the String 


Consider now the energy associated with the motion of the string. 
This 1s most naturally expressed in terms of an energy density, that 1s, 
energy per unit length. The calculation of kinetic and potential energies is 
essential to the formulation of the problem in terms of classical Lagrangian 
or Hamiltonian mechanics. (The problem is complicated, of course, by the 
fact that the degrees of freedom form a continuum. This, the usual sums 
over discrete coordinates will be replaced by integrals over the variable x 
which enumerates the present continuum of coordinates.) 

The kinetic energy per unit length of string is clearly seen to be 


and the total kinetic energy 
eel dy\’ 
K=3 | o(2) dx (1.32) 


the integral being taken over the whole length of the string, which we take 
tobe(O<=x SL). 

The determination of the potential energy is only slightly more 
difficult. If a portion of the string of initial length dx is, when displaced, 
stretched to a length ds, the increase in length is 


ds — dx = [i+ (2) 1 Jas 


In the approximation of “‘small vibrations” (dy/ax < 1) this becomes 


| aI 

nike dx 
Since this stretching takes place against a force of tension T, the potential 
energy gain, which is the work done against tension, is 


1, (9y\’ 
5 T ( >) dx 
so that the potential energy density is 
2 
y=57 (2) (1.33) 


and the total potential energy 1s 


yall 7(2y 
ya fra 434 


16 The Vibrating String 


Problem 1-2: Derive the potential and kinetic 
energies for the small longitudinal vibrations of a wire. 


The total energy is therefore 
H=[ #dx (1.35) 
0 
where the energy density # Is 


2 
# =50(2) +57 (2) (1.36) 

It is also possible to calculate the flux of energy past a given point x. 
This is the negative of the rate at which the y component of tension is 
doing work on the portion of the string to the left of x (thus, the flux is 
taken to be positive if energy is being transmitted from left to right, and 
vice versa). Since, as shown above, the y component of tension is T(dy/dx), 
its rate of working is (T (dy/ax) )dy/at. Designating the rate of energy flux 
as J, it follows that 


J=-TLD (1.37) 


From (1.36) and (1.37), along with the equation of motion, it is 
possible to derive an equation of continuity for the energy. This equation 
expresses the fact that the rate of change of energy on a portion of the 
string 

an # dx (1.38) 


is equal to the rate at which energy flows into the section over the ends: 


dy dy 
OX Ot |r, 


_ pavay 


ax dt|, = —J (x2) +I (x1) (1.39) 


For, calculating (1.38), 


o(? fi al 1, fay\?|, _ [* [_ ay ay 2 ZY) 
at Ja, 56 (3 +57 (2) je=fe lo at? + ax axar|~ 


Substituting for ( p (d?y/at?) ) from (1.4), this becomes 


5) “= [ay d/,,dy dy 0 (dy 
a _ Hd =| |2S(72)\+722(2 
at * [. E SAP a) tT ae ae) [ee 


2 


— 9Y 7 dy 
at | ax aa 


= —J (x2) +(x) 


as stated. 


4. Energy of the String 17 


Alternatively, given the expression for the energy density, the 
physically obvious condition of continuity could be used to define the 
energy flux J. 

It is instructive to calculate the energy density and energy flux for the 
general solution (1.10) 


y=fi(x—ct) +fo(xt+ ct) 
The kinetic energy density is 
KH =tpe (fi -f)’ (1.40) 


where the primes indicate derivatives of the functions with respect to 
their arguments. The potential energy density is 


Y=3T(fith)’ 


= toc? (fi +f’)? (1.41) 
Thus the total energy density is 


H = pce?(fi?+ fe”) (1.42) 


The important thing to note is that the waves moving in opposite 
directions each have their own energies. While there are interference 
terms in the kinetic and potential energy densities individually, there are 
no such terms in the total energy. It is of course true also that the energy 
in each of the waves is constant, since each is an integral over all values of 
x, and therefore of the argument. This is true for a finite string because of 
the periodicity condition and the condition of oddness in a fixed end, 
which ensures that the integral of the square of the function is the same 
over every interval of length L. Thus, constancy of the integral over all x 
implies constancy of that on (0, L). 

Consider next the energy flux 


J=-T(Ath)-hit+c) 
= Tc(f?— fe”) (1.43) 


Thus, the two waves flow independently. In the case of the wave moving 
in the direction of positive x, the flux J, bears to the energy density 7, the 
ratio 


statae (1.44) 


so that c is the rate of propagation of energy as well as the phase velocity 
of the wave. Designating the energy flux and density of the wave traveling 
in the direction of negative x as J, and #2, respectively, 


Jo 


#,° (1.45) 


18 The Vibrating String 


which says that energy in this component is transferred in the negative 
direction with a velocity c. 


5. Energy in the Harmonics 


In the preceding section we have an example of waves which inter- 
fere with each other so far as point by point displacement is concerned, 
but do not exchange energy with each other. This result is striking when 
one considers that two disturbances of finite extent and of the same shape 
but opposite in sign, and traveling in opposite directions, can momentarily 
extinguish each other in passing, and yet will, when they cease to inter- 
fere, each regain its original shape and propagate its characteristic energy. 

An even more striking result, which we demonstrate in this section, 
is that the individual terms in the Fourier series solution (1.30) are 
similarly independent, in that each carries a fixed amount of energy, and 
this energy cannot be exchanged with the energies associated with the 
other terms. The terms in the Fourier series are designated as “normal 
modes” or harmonics, each of which represents a possible mode of motion 
of the string with a definite frequency. We subsequently discuss the 
question of normal modes in a more fundamental way and in a more 
general context, but it is interesting to demonstrate at this point the 
constancy of the energy of each normal mode. 

The Fourier series solution is, as shown in (1.30), 


y= > C. sin sin (ae on (1.46) 
where 
Cn COS Qn = An (1.47) 
and 


c, sin a =» 
nr nr nTc nT 


In calculating the energy we make use of the “orthogonality” of the 
expansion functions, that is to say, of the fact that 


* .onmx.. n'nx, _L 
I. sin—— sin L dx = 5 San (1.48) 


where 


=1, n=n' (1.49) 


6. The “Loaded” String 19 


and of a similar equation for the integral of two cosines of the same 
arguments. The kinetic energy in this case is 


_ 1 ft nq nict _ nmx|? 
K=50 |. b L cn 008 ( L cn) sin “7 | dx 


_ 1 nirc\* nict 
“ex (Beers) am 


the cross terms dropping out because of (1.48). Similarly the potential 
energy is 
ni nwx ._ (nict 2 
v=57| bya T, cn cos — sin Game) dx 


=4ol > Gal C,? sin? (“Fe — a) (1.51) 


where we have used the fact that T = pc’. 

It is evident that there are no terms representing interactions of the 
normal modes in either the kinetic or potential energy. The potential and 
kinetic energies of a normal mode taken together give a constant; the 
total energy is 


H= iL > (“Fey C,2 (1.52) 


In each mode, the energy oscillates between kinetic and potential forms, 
as the string itself oscillates. The periods of the oscillations of the string 
are 


Th =~ (1.53) 
while those of the energy are half that great, corresponding to successive 
realizations of a given phase of the motion. 

The decomposition of the energy of the string into contributions from 
distinct normal modes appears in this problem as a mathematical tour de 
force which is not at all physically evident. Only when we come to the 
development of quantum mechanics will this sort of phenomenon be 
given a specifically physical connotation, and thus take on a very funda- 
mental significance. 


6. The “Loaded” String 


In the following sections we consider three special problems. First, 
we treat the problem of the reflection and transmission of a wave on the 


20 The Vibrating String 


string by a mass M attached at a fixed point, which for convenience we take 
to be x = 0. The results of this calculation are then applied to propagation 
of waves on a string to which point masses are attached at regular intervals. 
Finally, we deal with the problem in which a mass is distributed uniformly 
over a finite length a, that is, in which such a portion of the string has a 
density different from that of the rest of the string. In each instance we 
consider a periodic wave of circular frequency w. 


7. Reflection and Transmission 
at a Fixed Mass 


The effect of the mass may be formulated as a boundary condition to 
be applied at its position. The motion of the mass is determined by the 
condition that M(d*y/dt?),-. = net y component of tensional forces on the 
two sides of the mass. In the small-vibration approximation this is 
T[ (dy/dx)9,— (Ay/8x)o-], where by 0+ we mean a point just to the right 
of x = 0 and by 0— a point just to the left of it. In other words, the effect 
of the mass is to produce a discontinuity in the slope of the string of 


amount 
dy\ _ (ay =% (3) 1.54 
(>), (2)... T \dt?/ (154) 


Problem 1-3: A disturbance of arbitrary form y= 
fict —x), (x < 0), travels along an infinite string in the direction of increasing 
x and impinges on a discrete mass M attached to the string at x = 0. 

Show that the wave transmitted to the region of positive x is 


2 ct-2x 
y= 7; | F( x') e2PIMKE+L dy! e—2(PIM et 
= fi(ct—x) 


and that the reflected wave is 


2p ct+zxr 
y= M F(x’ ) e72(PIM\Ma—z") dx' e elm yct__ F (ct + x) 


Show that the fractions of the energy of the incident wave which are 
transmitted and reflected are 


f, LA (ct— x) P dx 
’ 
J Uf (ct—x) Pax 
respectively, f(x) and f,(x) = 0 forx < 0. 


and (1 —7) 


7. Reflection and Transmission at a Fixed Mass 21 


Problem 1-4: Do the calculations explicitly for the 
case of an incident wave: 


f (x) = ets 


What special problems arise from taking a complex function f (x)? 


If the disturbance being propagated is periodic, its time behavior may 
be described by e~”; it is, therefore, 


y= Aeka-ct) (1.55) 


where 


k=alc (1.56) 


The use of the complex exponential must of course be properly under- 
stood, since the displacement y is obviously a real quantity. An equation 
like (1.55) is to be understood to mean that the physical quantity is to be 
taken to be either the real or imaginary part of the expression on the 
right-hand side. If A is taken as complex 


A = A,e™ (1.57) 


it does not matter whether the real or imaginary part is chosen; the 
imaginary part for a given a Is the same as the real part for an a which is 
greater by 7/2. Therefore, in general, we adopt the convention that the 
physical quantity is the real part of the complex function. 

Consider a wave on the left-hand side of the string (x < 0), traveling 
toward the right, and therefore represented by (1.55). The reflected wave 
will be a wave of the same frequency on the range x < 0 and traveling to 
the left. Therefore the displacement on x < Ois given by 


y = Aeike-ct) + Be-ik(r+et) (1.58) 
On the right there will be only a transmitted wave traveling to the right: 
y= Ceik(e-ct) (1.59) 


An additional term in e~*“te on the right would give rise to a valid 
solution on that side, but would not correspond to the given conditions of 
the problem, since it would represent a wave incident from the right. 

There are two boundary conditions, which serve to determine B and 
C in terms of the given incident amplitude A. Aside from (1.54), there is 
the obvious condition that y must have the same value whether calculated 
from (1.58) or (1.59). The two boundary equations are, then 


A+B=C (1.60) 


22 The Vibrating String 


and 
: i AVE pests 
ikC — ik(A — B) =~ PPC (1.61) 
These may be solved to give 
_ iMk/2 
B= A iMk2 mee) 
ay pore aes 
C= 40 imme (1.63) 
If we introduce 
tan B = Mk/2p (1.64) 
the equations for B and C may be written as 
B=A sin Be*8+7!2) (1.65) 
and 
C =A cos Be* (1.66) 


As M — 0, B > 0 and B — Q, so that the reflected wave ceases to exist; 
since C = 1 there is complete transmission. In general the transmitted 
wave suffers a phase change of B, while the reflected wave is further 
changed in phase by 27/2. 

The physical displacements are, on the left 


y = A, cos [k(x—ct) +a] +Apy sin B sin [k(x+ct) +a+ B] (1.67) 
and on the right 
y = A, cos B cos [k(x—ct) +a+B] (1.68) 


Since, as we have seen, the energies of waves traveling in opposite 
directions are distinct, we may use (1.42) to write the energy densities in 
each of the waves: 


incident: pw’A,? cos? [k(x—ct) +a] (1.69) 
reflected: pw°A,* sin? B sin? [k(x+ct) tat B] (1.70) 
transmitted: pw7A,” cos? B cos? [k(x— ct) ta+ B] (1.71) 


The situation is somewhat simpler if we consider time averages, in which 
case each trigonometric function of time is replaced by . It is evident then 
that the fraction of energy transmitted is 


fr = cos? B (1.72) 


7. Reflection and Transmission at a Fixed Mass 23 


and the fraction reflected is 


Sr = sin’ B (1.73) 


the sum being, as it should be, equal to unity. 

If the mass were attached, not at x=0 but at x= xp, the above 
calculation would be altered only in that B would be multiplied by an extra 
factor e?**7o, 

It may be verified in a similar manner that if a wave De~*@tc) 
impinges from the right on a mass M at xp, we have a transmitted wave 


B= Dcos Be* (1.74) 
and a refiected wave 
C = Dsin Bet6+7!2) e-2ikzo (1.75) 


In (1.65), (1.66), (1.74), and (1.75) we have four equations relating 
transmitted and reflected amplitudes to incident ones. These may be 
given a matrix formulation, which relates the matrix of the transmitted 
amplitudes, (§), to that for the incident amplitudes on the two sides, (#). 
This matrix 1s a simple example of a scattering matrix, and in this instance 
takes the form 


— i6(COS B isin B 7) 
am (; sin B e?*** cos B oe) 


If we write the transpose conjugate of this matrix 


+ — ,-ie(COS B —isin B e~?**zo 
ey e isin B e** cos B ae 


it is easily verified that 
SS*+= StS = 1 (1.78) 


Such a matrix is known as a unitary matrix, and is discussed in the chap- 
ter on linear vector spaces. 

The utility of the scattering matrix lies in the fact that it gives, for 
any given wave incident on the scatterer, the amplitudes of the scattered 
waves, that is, of the waves emerging from the scatterer. 

The unitary property of the matrix expresses conservation of energy 
—the fact that the total flux of energy in the scattered waves is equal to 
that in the incident wave. For the incident wave is completely specified by 
a matrix M, and the scattered one by a matrix M’ related to the incident by 


M’' = SM (1.79) 


But by virtue of (1.37), the time average of energy flux in a wave is 
proportional to the magnitude squared of its amplitude (the proportion- 


24 ~The Vibrating String 


ality factor being the same for all partial waves). Thus the incident 
energy is M+M = (A*D*) (4) = A? + D’, while the scattered energy is 
M'tM' = (M*S*t)(SM) = M*M (1.80) 
by virtue of the unitarity of S. 
It may be asked why the time average of the energy flux is used. The 
reason lies in the fact that in each wave the energy flux oscillates with 


time; what flows in flows out, but with a time or phase delay. Averaging 
over time eliminates this complication. 

In the corresponding quantum problem, which is treated in a later 
chapter, this complication ts not present, since the flux in that case is 
constant in time even though the “wave function” is oscillatory. 


8. Propagation on a String with 
Regularly Spaced Masses Attached 


Let the attached masses be located at points x = na. In the section to 
the left of na, let the amplitudes of the waves moving to the right and to 
the left be designated r, and J, respectively, while those to the right of na 
are fp+, and I,,,. These correspond to the quantities A, D, C, and B, 
respectively, of the previous problem. Thus they are related by the matrix 


equation 
Vat. — rn 
("5") = 5a( 7") (1.81) 


the subscript on the scattering matrix S indicating that the scattering 
takes place at x) = na. Relations like (1.81) connect the amplitudes in 
successive sections. A formal solution of these equations may be obtained 
by putting 


rp = er(O-ka) R (1.82) 
and 
|, = ein Otkay (1.83) 


in (1.81), using S from (1.76). Two simultaneous equations are obtained 
for R and L: 


(etka) — cog B e®)R = ic eXO+k® gin B. T (1.84) 
and 
(1 — et e%6tk® cos B)L = ie* sin B-R (1.85) 


The condition for a nonzero solution is obtained by eliminating R and L. 
The resulting equation determines @ in terms of k = w/c (or conversely). 


8. Propagation on a String with Regularly Spaced Masses Attached 25 


After some simplification, it is found that 


cos (8 +ka) 


cos 0 = 
cos B 


(1.86) 

It is evident from this equation that the disturbance is propagated 
without attenuation along the string only if the right side is of magnitude 
less than unity. If it is greater than 1, 6 must be an imaginary quantity. If 
6 = i6', where 9’ > 0, r, and Ll, will be 


r, = e 7” o—inka R (1 87) 

a e776 pinkay (1.88) 
Thus the disturbance will be attenuated as it propagates to the right. If, 
on the other hand, 6’ < 0, these equations will contain positive rather 
than negative exponential, and it will be attenuated toward the left. 


Consider, then, the conditions under which the magnitude of the 
right-hand side of (1.86) will be less than unity. Using (1.64) and writing 


ka=x=—a (1.89) 
and 
M__ 
pa = N (1.90) 
it may be written as 
f(x) =cosx—Axsinx (1.91) 


The general features of this function may be simply described. For small 
frequency (w < c/a) it will be less than 1. The second term in (1.91) will, 
however, as x becomes progressively larger, grow in amplitude, so that 
f(x) will be less than 1 for intervals about the points x = nz, where zn is 
an integer, which decrease in size as n becomes larger. Calling the inter- 
vals of w for which | f(x)| < 1 “allowed regions” (representing the ranges 
of frequencies which can be propagated without attenuation), it is evident 
that sufficiently low frequencies are allowed, that “allowed” and “‘for- 
bidden”’ ranges alternate, and that the allowed regions, which in any case 
become successively narrower as the frequency increases, do so more 
rapidly the larger the value of ); that is, the larger the attached masses in 
relation to the mass of a section of the string. In Figs. 1.6(a), 1.6(b), and 
1.6(c) are graphs of the function f(x) for A = 0.2, = 1, andA = 5, respec- 
tively, the ‘‘allowed’’ regions being indicated by the sections of the x 
axis. 


Function f (x) of x = (w/c) a 
For A = M/ (2 pa) = .2. Lines on the axis 
6 correspond to regions of transmission. 


-4 


-6 


Function f(x) of x =(w/c) a 
For A = M/ (2 pa) = 1. Lines on the axis 
correspond to regions of transmission. 


8. Propagation on a String with Regularly Spaced Masses Attached 27 


Function f(x) of x =(w/c)a 
For \ = M/(2 pa) = 5. Lines on the axis 
correspond to regions of transmission. 


0 2 4 (c) 6 8 10 
Figure 1.6 


For very large ), the allowed frequencies all lie close to the values 


_ nme 
Oi (1.92) 


In this case the portions of string between the masses vibrate effectively 
independently. 

The greater the excess of | f(x)| over 1, the larger will have to be the 
imaginary value of 6 to satisfy (1.86), and the greater will be the rate of 
attenuation of the disturbances corresponding to the forbidden frequen- 
cies. Thus, the attenuation will be greater in the middle of the forbidden 
region than near its boundaries, and will in general increase with frequency. 

The tendency of periodic systems to exhibit alternately allowed and 
forbidden frequency ranges is well known in various branches of physics, 
for example, in filter networks [Brillouin, Wave Propagation in Periodic 
Structures'], in the lattice vibrations, and in electronic spectra of solids 
[Ziman, Theory of Solids?}. 

1New York: Dover, 1946. 
2London: Cambridge Press, 1964, Chapters 2 and 3. 


28 The Vibrating String 


9. Reflection by and Transmission 
through a Section of Different 
Density 


A problem similar to the previous one is that in which a section of 
string of finite length (stretching, let us say, from x = 0 to x = a) and of 
density p, is inserted into a string of density p,. In fact, the solution of the 
former can be considered as a special case in which a — 0, p, > in such 
a way that p.a = M. 

Since it is not essentially more difficult, we consider the more general 
case in which the left- and right-hand sections have different densities 
p, and ps, respectively. 

Consider an incident wave of a definite frequency, so that the time 
dependence is e~ throughout. In writing down the solutions in the 
different section, therefore, this factor may, for simplicity, be omitted. 

Designating the left, center, and right sections as I, II, and III, 


respectively, the solution for the case of a wave incident from the left 
only is, then 


inregionI: e+ Aer (1.93) 
inregionII:  Be**+Ce-tez (1.94) 
inregion III: Deé*= (1.95) 
where 
= 2 = Pi i= 
k, Z wy, i=1,2,3 (1.96) 


The amplitude of the incident wave may be taken to be unity since all 
other amplitudes are proportional to it. One boundary condition is that the 
displacement y should be continuous at the boundaries of the regions. The 
other is that the slope dy/ax should be continuous; this follows from (1.54) 
since there is no discrete mass at the boundaries. 

The application of these boundary conditions at x = 0 and x =a to 
the solutions (1.93)-(1.95) yields the equations 


1+A=B+C (1.97) 
ik,(1—A) = ikk(B—C) (1.98) 
B eikea + C e ikea —— eiksa (i 99) 


and 


ik, (Bet#2a — Ce-ik2a) = ik, Detksa (1.100) 


9. Reflection and Transmission at a Section of Different Density 29 


The simplest method of solution is to solve (1.97) and (1.98) in terms of A, 
and (1.99) and (1.100) in terms of D. From the resulting equations we can 
solve for A (reflected amplitude) and D (transmitted amplitude). The 
results are 


2 ki ke e7 tksa 


Eh yk Cos aa 1 ae ain ea 


(1.101) 


and 


AS k.(k, — kz) cos k,a + i(k,” — k,k,) sin k,a 
(k, “+ kz) ke COS k,a a i(k,” + ki ks) sin k,a 


Let these complex quantities be represented in terms of amplitude and 
phase: 


(1.102) 


A = A,e* (1.103) 
and 
D = Dv e® (1.104) 


Then the real y’s associated with the incident, reflected, and transmitted 
waves are 


incident: y=cos (k,x—of) (1.105) 
reflected: y=A,cos (k,x+ot—a) (1.106) 
transmitted: y= D,cos (k3x—wt+8) (1.107) 


Calculating the time averages of the energy flux we get $7wk,, ¢Twk,Ao’, 
and $7wk,D,", respectively. Therefore the fraction of the energy which is 
reflected (reflection coefficient) is 


kp? (k, — kg)? cos? koa + (k,? — k, kz)? sin? k,a 


— 2 = 4) eS 
Sa = Ao’ = 7 3(h + ky)? cos* eat (ki thik) sint ka 1-198) 
and that which is transmitted (transmission coefficient) is 
ke pjo_ hee 
o> a pak eh) cota Ue ithe 


It is easily verified that f, + fr = 1. 


Problem 1-5: When p, = pz, calculate the S matrix 
for scattering by the central section and verify that it is unitary. 


Problem 1-6: Put p, = p, and verify that as p, > « 
and a — 0, such that p,a = M, the results reduce to those of section 7. 


30 The Vibrating String 


Problem 1-7: An infinitely long string consists of 
two sections which have densities p, and p., respectively. A wave of unit 
amplitude incident on the junction of the sections from one side has 
reflected and transmitted amplitudes r and t, respectively. When the wave 
is incident from the other side the amplitudes are r’ and t’, respectively. 
Derive the relations 


r?+tt' =1, rf=—r 
An interesting special result is the following: If the density in the 
central portion is intermediate between the densities of the end portions, 
the existence of that portion increases the transmission coefficient. On the 
other hand, if its density is either greater than or less than that of the rest 
of the string, it decreases the transmission coefficient. For the transmis- 
sion coefficient when a = Ois 


Ak kgk,? 
fr(0) = Gk +h)? (1.110) 


If this is subtracted from (1.109) the excess transmission due to the 
central portion is found to be 


fr—Fr(0) = Fae {Ke? (ky + eg)? — (ke? + Kkg)?} sin? kya (1.111) 


where the empty brackets in the denominator represent the denominators 
of (1.109) and (1.110), respectively, both of which are positive. But the 
quantity in the curly brackets may be written as 


— (ke? — ky?) (ea? — ke?) = —© (ps — 91) (p2— pa) 


which is positive if p. is intermediate between p, and p, and negative 
otherwise. 


10. Inhomogeneous String and 
the Method of Separation of 
Variables 


The methods used up to this point do not lend themselves to obvig is 
extension if more general or complicated problems are considered; if, for 
instance, the density or tension of the string varies with position x. The 
problem of variable density is a natural one to consider. The tension also 
may be variable, however, if, for instance, the string has weight which is 
not negligible, and vibrates about a vertical position (in this case, it is per- 
haps best to think of a flexible chain rather than a string), or is whirled 


10. Inhomogeneous String and the Method of Separation of Variables 31 


with very high angular velocity about a vertical axis to which it is fixed. 
These problems are admittedly rather esoteric, but are of interest in that 
they lead to a type of equation which will appear repeatedly in other 
physical problems, and which is generally known as a Sturm-Liouville 
equation. 

Consider the problem of oo the general equation (1.4) 


po) = 2 [72] 


We approach this problem, as we do most other partial differential equa- 
tions, by the “method of separation of variables.’ This method involves 
seeking special solutions which are products of functions of the different 
variables of the problem; in this case, solutions of the form 


y=f(x)g(t) (1.112) 


If this 1s substituted into the equation, which is then divided through by 
y = fg and by p(x), the problem becomes that of solving 


ld’g_1 d[_/. df 
sha LS | Te) a (1.113) 
Now it is observed that the left-hand side of this equation is a function of 
the variable ¢ only, while the right-hand side is a function of x only. Yet the 
equation must be valid for all values of x and t. This is clearly only pos- 
sible if the functions appearing on both sides are a constant. Thus we are 
led to the “‘separated”’ equations 


a a constant (— A) (1.114) 
and 
14(re ) rae = (1.115) 


both of which are ordinary differential equations. 

Up to this point, there is nothing to prevent A from having an arbi- 
trary value. This situation is, however, changed by the application of 
boundary conditions. In the case of constant p and T, (1.115) becomes 


ota (1.116) 


If, as an example, the ends of the string at x = 0 and x = Lare fixed, y and 
therefore f must take the value zero at each of these points. If y were a 
positive constant, the solutions of (1.116) would be positive and negative 
exponentials (or alternatively hyperbolic sines and cosines), no combina- 


32 The Vibrating String 


tion of which can be made equal to zero at two points. Thus no negative 
value of » is permissible if the boundary conditions are to be satisfied. 
[This is fortunate, of course, since the solutions of the time equation 
(1.114) would then also be exponentials, and we would obviously not be 
describing a vibration !] 

If, on the other hand, A is positive, a general solution of (1.116) is 


f=A sin (EVix+) (1.117) 


where A and ¢ are constants. But this is zero at x = 0 only if ¢ = 0, and 
atx = Lif 


1 VAL = nn (1.118) 
where is an integer. Thus 

n= (FE) (1.119) 

f=Asin oi (1.120) 
and 

g=Bsin (“st a) (1.121) 


giving a solution which is one of the terms of (1.46). 


Problem 1-8: A string is stretched between points 
x=0 and x=L. At time ¢t = 0 it is pulled sideways a distance y, at x =a 
and released from rest. Show that the displacement at arbitrary time is 


yr a(L— a) sin —— L cos L 


2y, L? > sin nrait nwrx nirct 
c being the velocity of propagation of a wave on the string. Calculate the 
initial energy and verify directly conservation of energy. 


The boundary conditions have, therefore, in this case, limited the 
permissible values of A to a discrete set of values. These values are a 
particular case of what are in general designated eigenvalues; the corre- 
sponding solutions are known as “eigenfunctions.” 

Equation (1.115) is a slightly special case of an equation known as the 
Sturm—Liouville equation. The Sturm—Liouville equation is 


alae B®) —alx)y+ar(xy =0 (1.122) 


10. Inhomogeneous String and the Method of Separation of Variables 33 


In (1.115) the function corresponding to g is equal to zero. The function 
r(x), p(x) in the case of the string, is known as the “weighting function,” 
and is taken throughout to be a function that is positive for the range of x 
on which the problem is defined. 

It is not difficult to show that any second-order linear differential 
equation containing a y term with an eigenvalue multiplier can be trans- 
formed into the Sturm—Liouville form. For consider 


fol) +A) St flay =0 (1.123) 


Let us multiply this equation by g(y), and choose g so that the coefficient 
of dy/dx is the derivative of that of d*y/dx?: 


d 
<x (Bho) = ah (1.124) 
Putting eff = F 
FA 
o are (1.125) 
from which it follows that 
=: re) ey 
F=A dx 1.126 
CxP | f(x’) nee 


A being an arbitrary constant. Therefore if we choose the “integrating 
factor” 


oA | ae) 
& f(x) exp - f(x fy (1.127) 


the differential equation may be written 
Fy" + F'y’+fogy = 0 
(where y’ = dy/dx, y" = d*y/dx’, etc.), or 


Of course, it need not always be true that Kg > 0; we, however, confine 
our attention to this case, the significance of which becomes evident 
when we discuss the question of orthogonality of eigenfunctions. 


Problem 1-9: Write in Sturm-Liouville form the 
Bessel equation 


d’y , ldy 
dx’ ae teat Sy = : 


34 The Vibrating String 


11. Boundary Conditions and the 
Eigenvalue Problem 


For each value of the eigenvalue quantity \ the Sturm—Liouville 
equation (1.122) has two independent solutions. One formal way to see 
that this is the case is to think of expanding the solution in a Taylor’s 
series about any point x, at which the coefficients in the equation are all 
finite: , . 

— Vi it Xe) wm 

y= aT (Xo) (1.129) 
where by y“”(x,) we mean the nth derivative of y(x) calculated at x = x. 
If we choose arbitrary values for y(x,) and y’ (x,) (the first derivative), the 
differential equation makes it possible to calculate the second derivative 
y"(x 9). Then, successive differentiations of the equation will determine 
the higher derivatives. In this way, a formal solution is obtained in Taylor’s 
series form. Furthermore, because of the linearity of the equation, the 
series has the form 


y = y(Xo) ys (x) +y' (x9) Ye (x) (1.130) 


where y, and y, are separately solutions. 

In a physical problem in which the Sturm-Liouville equation arises 
(such as the problem of the vibrating string) the solution is normally 
subjected to boundary conditions. Consider as an example the case in 
which y must be zero at x = 0 and x = L, and suppose that we try to obtain 
the solution as a Taylor’s series about x = 0. Then the first solution in 
(1.130) will not appear. The condition that y.(x) be zero at x = L will then 
yield an equation for A. Clearly, the two boundary conditions will only be 
satisfied for particular values of A. In short, the boundary conditions 
determine the eigenvalues. 

We shall see, however, that the boundary conditions are not always so 
clearly specified; one or more may simply require that the solution remain 
finite for a physically significant value of the variable. Again, however, 
this will only be the case for suitable values of A. ) 

If the boundary conditions are applied at finite points, x =a and 
x = b, and the solutions are both finite at these points, boundary conditions 


ay'’+ By =0 (1.131) 


which prescribe a linear relationship between the function and its deriva- 
tive at each boundary lead in a similar manner to the determination of the 
eigenvalue. For if such a relation exists at x = a, and we expand the solu- 
tion about this point, one of the constants in (1.130) may either be elim- 
inated or expressed in terms of the other. In this case, again, only a 


12. Orthogonality of Eigenfunctions 35 


suitable choice of X will permit the satisfaction of the condition at the 
other boundary. 


12. Orthogonality of 
Eigenfunctions 


This section is concerned with a very important theorem, which 
says that, under conditions to be specified, the eigenfunctions of the 
Sturm-Liouville equation are orthogonal. This orthogonality is defined 
by the equation 


f Ym (X)¥n(x)r(x)dx = 0, mAzn (1.132) 


where y,, and y, are the eigenfunctions belonging to different eigenvalues 
Am and d,, and r(x) is the weighting function. 

The most general statement of the condition for the validity of the 
orthogonality equation is that 


P(YmYn—Yn¥m) |2 = 0 (1.133) 


the left-hand side being the difference of p(ymyYn—YnYm) at the points b 
and a. This condition may be satisfied in various ways, the most important 
of which are 

(a) that boundary conditions (1.131) hold at each end, or 

(b) that y and y’ remain finite while p = 0 at one or both of the ends, 
in place of the linear boundary condition. 

A further possible condition would of course be that p, y,,, and y, be 
periodic on the interval (a, b). 

The orthogonality theorem is proven as follows: y,, and y,, satisfy 


(PYm)' — Wnt AmrY¥m = 9 (1.134) 
and 


(Pyn)' — Gynt rntVn = 9 (1.135) 


where the primes denote differentiation. Let us multiply (1.134) by yp, 
(1.135) by y,,, subtract, and integrate from a to b. Now 


b 
J, Lyn(Pym)' —¥m(Pyn) Idx = P(YnYin—YmYa) IB 


on integration. Thus, by (1.133), it is zero. The g terms vanish, and one 
has left only 


(Xm —An) fi Ym (2) n(x) r(x) dx = 0 (1.136) 


The orthogonality theorem follows, since A,, ¥ An. 


36 =©The Vibrating String 


Since the y’s are determined only to within a multiplicative constant, 
we may choose them to obey the “normalization condition.” 


b 
J my2(x)dx = 1 (1.137) 
This relation, along with (1.132), may be written 


b 
ee ee ee 
J, IaImde (1.138) 


= 0, nAm 


The functions y, are known as an “orthonormal” (mutually orthogonal 
and normalized) set. 


13. Rayleigh-Ritz Variational 
Principle 


This section is concerned with a variational principle for the eigen- 
functions of the Sturm-Liouville equation. Consider the quantity 


I(y) =f (py'*+qy*)dx /' f ry’dx (1.139) 


We can show that the functions y for which / has a stationary value are the 
eigenfunctions of the Sturm-—Liouville equation. This statement has the 
following meaning. Suppose / is calculated for y equal to an eigenfunction 
y,- Let us then calculate it for the function 


y=y,ted (1.140) 


where ¢€ is a parameter and ¢ is an arbitrary bounded function which is 
zero at both ends of the interval of integration, x = a and x = b. Then / 
is Stationary for y = y, if 


(S)_ =0 (1.141) 


Suppose (as will generally be the case in the problems we shall consider) 
that J is a local minimum in ¢€ at « = 0. Then, our principle states that J 
will be Jower when calculated for an eigenfunction than for any function 
slightly displaced from it. 
It is convenient to introduce the following notation: Let us expand / 
as a series in € 
I(y) =1(y,) + €61 + O(e?) (1.142) 


6/ then stands for the coefficient in the term in the expansion which is 
linear in the parameter e. Equation (1.141) then may be written 


6/ =0 (1.143) 


13. Rayleigh-Ritz Variational Principle 37 


Before proving the variational principle, let us express it in another 
way. If we write 
I(y) =NI/D (1.144) 


N being the numerator and D the denominator of (1.139), the principle 
may equally well be written: For an eigenfunction 


5N =0 (1.145) 


subject to the auxiliary condition 
b 
D={ ny*dx=1 (1.146) 


Thus, of all normalized functions, the ones for which the numerator N is 
stationary are the eigenfunctions. 

In either case, of course, the principle is stated in such a way as to 
ensure that the quantity being varied cannot be changed in value simply 
by multiplying the function by an arbitrary constant. 

That (1.145) and (1.146) are equivalent to (1.139) can be shown as 
follows: 


51 = 5(N/D) 


_ DEN —N@D 4.147 


If y is normalized, D= 1, 6D = 0, and 6/J = 0 implies that 6N = 0 and 
vice versa. 

If y is not necessarily normalized, it may be seen from (1.147) that the 
variational principle may be expressed in the form 


5N —15D = 0 (1.148) 


Under suitable conditions, the value of J for y = y, may be shown to be 
equal to the eigenvalue A,. For if we multiply the eigenvalue condition 
(1.135) by y, and integrate from x =a to x=b and then integrate the 
first term by parts, we get the equation 


! b b 
PYnYne— [, (py +tayade try f ryidx = 0 
Therefore, A,, = 7 provided 
PYnYnla = 0 (1.149) 


This condition, it should be noted, is different from the boundary condition 
(1.133) which defines the Sturm—Liouville problem. Nevertheless, there 
are circumstances which will ensure the satisfaction of both, for example, 
(a) one of p, y, or y’ = 0 at each end, or 
(b) periodicity of p, y, and y’ on the interval (a, b). 


38 The Vibrating String 


In either of these circumstances, 


(Yn) = An (1.150) 
and the variational principle satisfied by the eigenvalues may be written as 
dN —)A,5D = 0 (1.151) 


14. Approximate Calculation of 
Eigenvalues from the Variational 
Principle 


The Rayleigh-Ritz principle may be used to estimate eigenvalues 
when exact calculation proves difficult. The essential fact is, that if we 
calculate 


| 
dle 


for a function which is a reasonably good approximation to an eigen- 
function, the error being of order e, the value of J will differ from the mini- 
mum only to order e?. If, as under the conditions specified in the previous 
section, J is the eigenvalue, we can thus make a quite accurate estimate of 
it. This is particularly useful if there is a lowest eigenvalue, in that we can 
in this way put an upper bound on that eigenvalue. 

It is, of course, important that we find a “‘trial function” which can 
approximate reasonably well the true eigenfunction. The choice of good 
trial functions requires skill. Usually, one’s physical intuition suggests its 
qualitative behavior. If one then chooses a function which can represent 
such a behavior, and containing one or more disposable parameters, one 
can minimize / as a function of those parameters. The minimizing values 
will then define the best possible approximation to the true eigenfunction 
of the form chosen. 

It must, of course, be emphasized that the function obtained in this 
way need not be a particularly good approximation to the true eigenfunc- 
tion, even if it gives an excellent approximation to the eigenvalue. This is, 
after all, the whole point of the variational method —that the discrepancy 
in the eigenvalue is only of second order relative to that in the eigen- 
function. 

It should be noted, finally, that one requirement of the trial function is 
that it should satisfy the boundary conditions of the problem. 

To demonstrate the accuracy of the method, consider a simple prob- 
lem for which the answer is known exactly, viz. the solution of the 
equation 


y"+ry = 0 


14. Calculation of Eigenvalues from the Variational Principle 39 


subject to y=0 at x=0 and x=1. The eigenfunctions are of course 
sin n7rx, where n is an integer, and the eigenvalues are 


dA = n* 77? 
In this case 
1 
J y’*dx 
[= ~—_— (1.152) 
J, ra 
0 
A simple trial function, containing only one parameter, is 
y =x(1—x)(1+ ax) (1.153) 
The corresponding value of X is 
— 4, 2/Sa2#+at+1 
A= 05a tat (1.154) 
This is found to have zero derivative for a= 0 and a =—?, the corre- 


sponding values of \ being 
A, =10 and A,= 42 
These values lie quite close to the lowest two eigenvalues, 
A, = 7? = 9.870 and d, = 4x? = 39.479 


It may then be observed that with a = 0, (1.153) is an even function 
about the central point x = $ of the interval, while for a = — 2 it is odd. In 
fact, if we put 

x=x-—} (1.155) 
we obtain 


y = (x?—4) [ax+ (Gat1)] (1.156) 


Is this symmetry property of the eigenfunctions fortuitous, or is it of 
fundamental significance? It is not an accident at all. To see that this is 
so, let us define a symmetry (parity) operator P such that 


Pf (x) = f(—3) (1.157) 
Now under the operation P both our differential equation and the boundary 
conditions are unaltered. Therefore, if y,(x) is an eigenfunction, Py,(x) = 


y,(— X) is an eigenfunction with the same eigenvalue. The same is then 
true of 


Yn (X) = 4 yn (%) +yn(—X)] (1.158) 
and 
Yn (X) = 4[yn(X) — Ya(—X)] (1.159) 


40 The Vibrating String 


which are, respectively, even and odd functions, and of any linear com- 
bination 


ky yn'(X) + keyn () 


We may substitute this in (1.150); if we make use of the fact that the 
integral of the product of an even and odd function is zero, we find that 


f- y'2(x) dx 
—1/2 

f~ y?(x) dz 
—1/2 


ke? [- [yn@"(x)Pede+ ke? [Cyn (Z) Pde 

ae -1/2 —1/2 

ke fo [yn (a) det he? [Lyn (a) Pd 
—1/2 —1/2 


— KA + ke*B 
ky?C+k?D 


Minimizing with respect to k, and kg, it is seen that the solutions corre- 
spond either to k, = 0 or k, = 0. Thus, the eigenfunctions must be either 
even or odd. 

This example illustrates the value of symmetry principles in the treat- 
ment of eigenvalue problems for linear operator equations. This point 
is dealt with more fully in Chapter 2 on the general theory of linear 
operators. 

In the context of the present problem, symmetry considerations sug- 
gest how to get better approximations to the lower eigenfunctions. To 
improve the calculation of the lowest eigenvalue, we can choose an even 
function containing one arbitrary parameter, 


y = (4-7) (1+ AX?) (1.160) 
On calculation it is found that 
Tae: 44A?2+28A +35 
4A?+ 124421 


Minimization leads to the quadratic equation 
52A?+ 196A +21 =0 


To approximate the lowest eigenvalue we must choose that root which 
leads to the smaller value of 4; this turns out to be 


A =—0.1103749 


14. Calculation of Eigenvalues from the Variational Principle 41 


The corresponding value of A is 


h = 9.86975 


compared with the correct value (to the same number of significant 
figures) 


dh = 9.86959 


Thus, with only one arbitrary parameter, if it is well chosen, it is possible 
to obtain the eigenvalue with very high accuracy. 

What of the calculation of the higher eigenvalues of each symmetry? 
Of all functions orthogonal to the lowest eigenfunction,? that which yields 
the minimum value for the integral J is the second lowest. This remark, 
however, has limited practical value, since, unless one knows the lowest 
eigenfunction exactly, one cannot apply to trial functions the necessary 
orthogonality condition. It 1s, of course, possible to choose only functions 
orthogonal to the approximate form of the lowest eigenfunction. It may 
be possible, by choosing a trial function judiciously, to get a quite close 
estimate of the next eigenvalue. However, one cannot be sure, in this 
case, that the estimate will be above the true eigenvalue; it might, in fact, 
be either above or below it. 


Problem 1-10: Using the trial function 
y = X(4—X7) (1+ Bx?) 


estimate the second lowest eigenvalue. 


It should be emphasized in any case, that whether or not the variational 
principle yields an approximation to the lowest eigenvalue depends 
entirely on whether the trial function can, for suitable value of its param- 
eters, approximate the lowest eigenfunction more closely than any other. 
This point may be emphasized by pointing out that if, by some accident, 
we were to choose a trial function which was orthogonal to the five lowest 
eigenfunctions, it could at best give us an approximation to the sixth 
eigenvalue. 


Problem 1-11: Show that the trial functions given 
by the two values of a in (1.153) are mutually orthogonal. 


Problem 1-12: Set up the Rayleigh-Ritz variational 
principle for the zeros of the Bessel function of order n, J,(kx), which 


3We use the term “lowest eigenfunction” as an abbreviation for ‘‘eigenfunction corre- 
sponding to the lowest eigenvalue.” 


42 The Vibrating String 


satisfies the differential equation 
” 1 ’ ee =) = 
y ee +(k 2 y=0 


and the boundary conditions: (i) at x = 0, J, is an even or odd function 
according as n is even or odd, and (ii) J,(kx) = Oat x = 1. 
For the zero-order Bessel function, use the trial wave function 


y = (1—x?) (1+ ax?) 


to get approximate values for the two lowest zeros. 
Show that the trial functions for the two values of a are mutually 
orthogonal. 


Problem 1-13: A similar result was obtained for the 
oscillator equation. Can you find a general theorem of which these are 
particular examples? 

Problem 1-14: Using (1—.x?) times a quartic orthog- 
onal to the approximate lowest eigenvalue, estimate the next eigenvalue 
of the same symmetry. 


Problem 1-15: Solve the same problem for the Bessel 
function of order 1, using the trial function y = x(1—x?) (1+ ax?). 


Answers: For Jo, the approximate values of the two 
lowest zeros, using the one-parameter trial function, are found to be 
2.4050 and 6.0731, compared with correct values 2.4049 and 5.5201. 
With y = (1—x?) times a quartic, we get 5.5418. For J,, the approximate 
values are 3.8343 and 8.0807, and the correct ones 3.8318 and 7.0155. 
Note that the higher zero is, in all these examples, given less accurately 
than the lower. Can you give a plausible explanation why this 1s so? 


15. Expansion in Eigenfunctions 


In the vibrating string problem, the method of separation of variables 
can provide solutions which are products of functions of the two variables, 
and which satisfy the boundary conditions. The spatial functions are the 
eigenfunctions of the Sturm-—Liouville problem. . 

Because of the linearity of the original differential equation, any 
linear combination of these separated (product) solutions also satisfies 
both the equation and the boundary conditions. The most general such 


15. Expansion in Eigenfunctions 43 


solution is 
y = F(x, t) = 2 yn(x)en(t) (1.161) 


where the arbitrary constants are absorbed into g (we assume the y,’s to 
be normalized). 
Now the initial conditions of the problem are in the form 


y = fo(x) att=0 (1.162) 
and 
® = o9(x) at r= 0 (1.163) 


We will be able to satisfy these conditions with the solution (1.161) if we 
can choose the two arbitrary constants in the g,’s to satisfy the equations 


fo(x) =! 2n(0)yn(x) (1.164) 
and 
vo(x) => 8n(0) yn (x) (1.165) 


where 2, = dg,/dt. 

The question is, then, whether the somewhat arbitrary functions fg 
and v, can be represented as series in the eigenfunctions y,,(x) of the 
Sturm-Liouville problem. We say “somewhat” arbitrary because physics 
certainly does not permit the functions which are to be expanded to be 
anything but a very well-behaved special set out of the totality of ““mathe- 
matician’s functions.””» They must, for instance, be continuous and 
bounded. 

We assume “without proof” that the functions y,,(x) form an infinite 
set, and also that the eigenvalues form a set tending to infinity. To talk of 
“proving” these statements is in fact not to put the problem correctly. In 
all of the physical problems with which we deal, it will be found, by actual 
calculation, that these conditions hold. It is therefore clearly possible 
to “prove” them under some conditions broad enough to encompass all 
physicists’ problems. The mathematical question might then be inverted 
to ask, what are the most general conditions under which the statements 
can be proven? Put this way, however, the question is really not of much 
interest to a physicist. If all of the problems he turns up are of this type, he 
will be interested in whether the expansion theorem can be proven under 
these conditions, and this we show. If, perchance, he should one day meet 
a problem in which these conditions do not hold, he will, either using his 
own intelligence, or by consulting a mathematician, have to find other 
methods of proceeding. 


44 The Vibrating String 


One further remark should be made before proceeding further: We 
have made the assumption that the eigenfunctions and eigenvalues form a 
discrete set, which may then be enumerated by integers n. There are, 
however, physical problems in which the “spectrum”’ of eigenvalues is, at 
least in part, continuous. In this case the eigenvalues will be enumerated 
by a continuous variable a. At least part of the “sum” in the expansion 
theorem will then have to be replaced by an integral, and we have 


y(x) = cayn(x) + J c(a)y (xs a) do (1.166) 


The sum may then be either a finite or an infinite one. The orthogonality 
theorem still holds, but we can see that it is necessary to modify some- 
what the normalization condition. For suppose we multiply both sides of 
(1.166) by r(x)y(x, a’) and integrate over x. Assume that we can inter- 
change the order of the integrations. The integral 


J y (x; a)y(x; a@’)r(x)dx 


is different from zero only for one value of a. Now the left-hand side of the 
equation will certainly not be zero, in general. The contribution from the 
sum on the right, on the other hand, is zero by orthogonality. Thus, the 
integral of c(a) with a function which is zero at all points except a = a’ 
must be finite. Such a “function” is known as a “Dirac 6 function” and 
is written 5(a—a’). Then 5(a—a’) = 0 except when a = a’, yet fc(a)d(a 
— a')da is finite. We specifically define the “5-function”’ so that 


J c(a)5(a—a’)da = c(a’) (1.167) 


whenever the range of integration includes the value a = a’. Thus, the 
integral of the “6 function’’ itself is unity. The “equation” 


J yQes a) y (x; a" )r(x) dx = 8(a— a2’) (1.168) 


then represents a way of ‘“‘normalizing” the functions y. Multiplying y by a 
constant A simply produces a multiple A? of the “6 function” on the right. 

The terms “6 function,” “equation,” and “normalizing” in the pre- 
ceding paragraphs have been put between quotation marks because, in the 
strict mathematical sense, the “*5 function” is not a function at all, so that 
both the “equation” and the “normalization” have not quite the expected 
meanings. The “6 function” (which we refer to henceforth without its 
qualifying quotation marks) represents a generalization of the concept of 
function known to mathematicians as a “‘distribution”’.* Within the theory 


4Lighthill, M. J., Introduction to Fourier Analysis and Generalized Functions. London: 
Cambridge University Press, 1958. 


15. Expansion in Eigenfunctions 45 


of distributions, the operations which we apply to 6 functions in this book 
can be made mathematically valid and meaningful. In any case, 6 func- 
tions were first used by physicists for the description of physical reality, 
and the validity of the operations carried out on them can be justified on 
physical grounds. 

Continuous eigenvalue spectra most often arise in cases in which the 
dependent variable has an infinite range. But infinite ranges of variables 
are, from the point of view of physics, only convenient fictions. Normally, 
one can approximate actual physical conditions just as well by the appli- 
cation of boundary conditions at finite, if large, values. Consequently, it 
is usually possible to avoid continuous-spectrum problems. It is, however, 
not always technically convenient to do so. However, for the subsequent 
development, we confine our attention only to discrete spectra. When, in 
Chapter 3, we explore the transition from Fourier series to Fourier inte- 
grals, it will be seen what sort of considerations are involved in the 
transition from a finite to an infinite range. 

Returning now to the question of expansion in eigenfunctions, sup- 
pose it is assumed that a function f(x) can be so expanded; 


f(x) = yy AnYn(X) (1.169) 
where the y,’s are normalized. The a,’s may be found simply by multi- 
plying the equation by any particular eigenfunction y,,(x) and by the 
weighting function r(x), and integrating between the values of x for which 
the boundary conditions are specified. Because of the orthogonality of the 
eigenfunctions, only one term survives on the right-hand side, and the 
coefficient is determined: 


am =f f(2)Ym(x)r(x) dx (1.170) 


Thus, if the expansion is valid, the coefficient is determined. 

We may understand (1.170) better if we proceed in a different way. 
Suppose one tries to make a best weighted least-squares fit to f(x) by a 
finite sum of the first N terms of the series. To do so we want to minimize 


by=f |) - & cana] r(nrax (1.171) 


Using the orthonormality of the y,,’s, this may be written 


Ay = { LF (x) Pr(x)dx—2 D Cn4n+ S G 


where a, is given by (1.170) and the integrals are over the range (x,, Xe). 


46 The Vibrating String 


Ay IS minimized by the condition 


dAy _ 

Cn 0 
from which it is found that 

Cn = An 


Thus, the coefficients (1.170) give the best approximation to f(x) by a 
finite series of eigenfunctions. 

An obvious way of posing the question of the validity of the expan- 
sion theorem is, now, to ask whether Ay > 0 as N — &. For if it does, 
since the integrand of Ay is never negative, it must be everywhere zero. 
Again, this need not be so for ‘mathematicians’ functions”’; f(x) and the 
series could differ on a set of points of measure zero. This, however, 
does not represent a physically realizable situation, due to the well- 
behaved nature of the physicists’ functions. 

Consider the function 


by (x) =f (x) — s a,Yn(X) (1.172) 


This may immediately be shown to be orthogonal to all of the y,’s for 
n = N. This function may be normalized by dividing it by 


Aw = f [du(x)]}*r(x)dx (1.173) 


Let us now recall the variational theorem as expressed in (1.139) 
and (1.141). This theorem stated that the quantity /(y) defined there was 
a minimum when y was the lowest eigenfunction of the Sturm—Liouville 
problem. It also followed that, of all functions orthogonal to the lowest 
eigenfunction, that which made y a minimum was the second lowest 
eigenfunction, and so on. Thus, of all functions orthogonal to the first 
N eigenfunctions, that which makes /(y) a minimum is the (N+ 1)st 
eigenfunction, which gives it a value equal to the (N+ 1)st eigenvalue 
Ana1- Pherefore, 


I (dy) 2 Anet 
or 


2 
x | (pdby + gon?) dx = Anai (1.174) 
1 


Therefore, when, as N — ~, Ay4, — ©, subject only to the condition that 
the integral exists, it follows that Ay > 0, and the expansion theorem 
follows. 


16. The lnhomogeneous Problem for the Vibrating String 47 


16. The inhomogeneous Problem 
for the Vibrating String 


First, let us recapitulate the solution of the homogeneous equation. 
The results of separation of variables were shown in (1.114) and (1.115). 
The latter determine the eigenvalues \ = A,. Solution of (1.114) shows 
that Vn is the circular frequency of the periodic time-dependent func- 
tion g. We therefore write 


ae (1.175) 


Let us now consider the problem of the vibrating string (which may 
be inhomogeneous) under the influence of an external applied force. The 
latter will be described by a “force density” or force per unit length 
F(x, t) which will of course also in general vary with time. The equation 
of motion is then 

dy _ 9 2) 

psa 2 (re 4+ F(x,2) (1.176) 

It is actually convenient to introduce, in place of the force per unit length 
F,, the force per unit mass 


E(x, t) = F(x, t)/p (1.177) 


A solution of (1.176) may be found by expanding both y and E in 
eigenfunctions of the associated Sturm—Liouville problem, the coeff- 
cients being functions of f; 


y=) crlt)yn(x) (1.178) 
E(x, t) =D én(t)yn(x) (1.179) 


Substituting in (1.176), making use of the fact that y, satisfies the eigen- 
value equation 


d (7dyn\__ 2 
Ag (7 72) PWn7 Yn (1.180) 
and equating coefficients of each y,, we obtain the equation 
2 
= a 2ey ter (1.181) 


It is easily verified that the solution of this equation satisfying the initial 
conditions that c,(0) = c,(0) = Ois 


t 
(et) = | e, (t') sin w,(t—t') dt’ (1.182) 
nJ0o 


48 The Vibrating String 


Substitution in (1.178) then gives the solution of the problem when the 
string 1s initially at rest and the force is applied starting at t = 0. 

An interesting particular case is that in which the applied force has a 
definite frequency w,. This may be solved from (1.182) with 


e, (t) = e,(0) sin wot (1.183) 


Problem. 1-16: Solve the problem of the uniform 
string for which the end at x =0 is fixed and that at x = L is jiggled 
according to the law 

y(L) = acos wot 


and was at rest at time t=0. (Hint: Introduce as dependent variable 
Z= y—a(x/L) Cos wof.] 
Problem 1-17: Solve the problem of the motion of a 
string fixed at x = 0 and x = L for the external force per unit mass 
E(x, t) = 8(x—x')8(t-’) 


[The resulting solution y = G (x, x’; t—1t’) is the ““Green’s function” of the 
problem. ] 


On substitution and evaluation of the integrals it is found that 


c,(t) = e, (0) . (2 SIN Wot — Wo SIN eat) (1.184) 
@," — Wo Wy 
The energy absorbed by the string per unit time is 
L 
OW = p(x’) E(x", t) 2 (x', tax! (1.185) 
ar ar 


Using the expansions (1.178) and (1.179) and the orthonormality of the 
y, S, this becomes 


abe e,(t) Sas den) (1.186) 

anil from (1.183) and (1.184) we find that 
ee — » *Len(0) P oar Ge ae [sin 2wof — sin (@_ + @o)t— sin (@)— @, )t] 
(1.187) 


The average increase of energy of the string over a long period of time is 
zero unless w, = @,, in which case all terms average to zero except the 
nth; from it, however, 


ot = Hen(0)]*t(1—cos 2w,t) (1.188) 


16. Thelnhomogeneous Problem for the Vibrating String 49 


In this case the energy of the string increases with time. Mathematically, 
it becomes infinitely great; it must be remembered, however, that as soon 
as the oscillations cease to be small, the approximations used in setting 
up the problem become invalid. 

What we have shown, of course, is that the string “resonates”’ to 
applied frequencies equal to any of its natural frequencies. This reson- 
ance occurs, however, only when w,. = w, exactly; that is, the resonance 
has no width. 

If there is a frictional or resistive force proportional to the velocity, 
such as would be provided in practice by air resistance, the equation of 
motion takes the form 

ay =2(73 oy) — kp ~+ pE (1.189) 
Pare ax\" ax p> ar |? ; 


where k is the resistive force constant per unit mass. Expanding both 
sides as series in the eigenfunctions y,(x) and equating coefficients of the 
two sides gives equations for the coefficients c,: 


Cat @n2Cn tkCn = Cn (1.190) 
If e, =e,(0) sin wot = e,(0) Im e*, where Im means “imaginary part of,” 
putting c, = Imy,e*” yields a particular integral of (1.190): 
pee. € n (0) eiwol 
Wn? — Wo? tikwy 
= e, (0) . _ 
i Veu=ueyeese cern) 


Cy, = Im 


where 
— 


tan B, = 78 


(1.192) 


The rate of energy absorption formula (1.186) now becomes 


@. : 
= ¥' [e, (0) ]? —————_—_———- gin apf cos (wot — Bn) 
ot 2 - (co,? — 2)? +k?” ° of — Bn 
(1.193) 
The average rate is zero except for those terms in which the force and 
velocity are in phase. In that case we can average over a number of cycles. 


: kw 
SIN Wof COS (Wot — Bn)) = sin 4 ree eee 1.194 
( Wo (@o B )) Bn = 5) = wo?) 7 kw,2 ( ) 
where the brackets ( ) indicate a time average. Therefore 


kon 
«4 at r) = 3 5D Gos? ant)? + Ka, eal}? (1.195) 


50 The Vibrating String 


This shows resonant peaks for w, in the neighborhood of the natural fre- 
quencies w,, but these peaks now have a “width.” One frequently speci- 
fies the half-width at half-maximum, which is 


lwo — w,,| = kw, (1.196) 
The ‘‘strength”’ of the resonance (value of aW/dt at resonance) is 
@,|kw,, 


and so decreases as the resonant frequency increases. As k — 0, the 
strength of the resonance approaches infinity and its width approaches 
zero, as seen above. 


17. Green’s Function 


The previous section has been concerned with the solution of the 
inhomogeneous problem; the determination of the response of a system to 
an external stimulus. The concept of a ““Green’s function” is a very use- 
ful one for this problem. In the present section we introduce a one-dimen- 
sional Green’s function associated with the Sturm-Liouville problem, and 
show how it may be used to solve the problem of the vibrating string 
under the influence of an external force which is periodic in time but has 
an arbitrary spatial variation. In Chapter 5, we generalize the concept 
of Green’s functions to two or more dimensions, one corresponding to 
each independent variable of the problem. We will then be able to dis- 
cuss the problem of arbitrary time as well as space variation. However, 
this is conveniently done only after the development of the theory of 
Fourier transforms. For the present, the more limited problem will 
serve to introduce the concept and to illustrate the use of Green’s 
functions. 

If, in the string problem, we assume an external force with a periodic 
time variation e#%, 

F(x, t) = p(x) w(x)e™ (1.197) 


it is clearly possible to find a particular solution of the differential equation 
in which y has the same time variation 


y=u(x)e (1.198) 
The inhomogeneous differential equation for u is then 


5 (734) +0*pu =— pw (1.199) 


A solution may be found by expanding both sides of the equation as a 


17. Green’s Function 51 


series in the normalized eigenfunctions y,(x) of the homogeneous 
problem: 


u= >) CaYn(X) (1.200) 
w(x) =D anya (x) (1.201) 

The coefficients in (1.201) are 
an = J = y,wpdx (1.202) 


Substituting the expansions in (1.199), equating coefficients, and using the 
eigenvalue equation 


£ (7 os) + @n"pyn = 0 (1.203) 
we find that 
Cp(W* — Wy”) = — Ap 
or 
C= oa (1.204) 


Using (1.202) for a, and substituting for c, in (1.200) we obtain the 
solution 


w(x) = eS | ele yale weds’ (1.205) 
This may be written in ie form 
u(x) = f° G(x, x/)p(x')w(x')de’ (1.206) 
where 
G(x, x’) = 5 ale) (1.207) 


n 


G(x, x’) is the “Green’s function” of the problem. 
Problem 1-18: Prove that 
2 2 ae , , , ’ 
Ya) = (wy? — 02") J p(x')yn(x")G (x, x") dx 
Let us now enumerate some properties of the Green’s function. 


(a) Since p(x')w(x’) dx’ is the force exerted on the element of string 
(x', x’ + dx’), G(x,x’) is the quantity by which one must multiply that 


52 The Vibrating String 


force to get the resulting displacement at x. The total displacement is then 
the sum (integral) of the contributions from the different elements of force 
(a result which follows in any case from the linearity of the equation). 

(b) G(x, Xo) is the displacement at x due to a 6-function force density 
(force per unit length) at x). This follows on putting 


p(x')w(x') = 8(x’ — x9) (1.208) 


in (1.206). 

(c) The eigenvalues of the problem are the values at which G, con- 
sidered as a function of the eigenvalue variable, has singularities. This 
again reflects the resonance phenomenon mentioned earlier. 

This last observation may in some instances provide a means of 
obtaining the eigenvalues. For it is sometimes possible to solve directly 
the Green’s function equation 


d (..dG ae ee 
(7G) +0°G = 5(x—x’) (1.209) 


Problem 1-19: By calculating directly the Green’s 
function for the case in which p and T are constant, determine the eigen- 
values of the problem. 


At all points except x = x’, G satisfies the homogeneous equation. The 
6-function on the right simply provides a boundary condition at x = x’. 
For suppose we integrate the equation from x = x’—e to x = x'+e and 
then let «e — 0. The contribution from the second term approaches 0, and 
we get 

raG 


0G 
Ox Dox 


sad + Ox 


=—1 (1.210) 


x'— 


If we were to integrate (1.209) from a fixed point, say x,, to an arbitrary 
point x, we would get 


9G _ 


Dox 


Ee +{ w*pGdx' = 0 Dae 


=—1 x>x' (1.211) 


If this is divided by T(x), and the resulting equation Is integrated from 
x'—€ to x’ +e, it follows that as « — 0, G(x, x’) is continuous across the 
point x = x’. This condition, along with (1.210), then provides boundary 
conditions subject to which the homogeneous equation must be solved to 
give G. 


18. Effect of a Perturbation of Density 53 


18. Effect of a Perturbation of 
Density 


Let us suppose that we are able to solve the string problem for a 
density distribution p)(x); in this section we show how we may solve 
approximately in the case 


p(x) = po(x) +Api (x) (1.212) 


where Ap; < py everywhere. It is useful to introduce the parameter as a 
small constant, so that p; may be comparable with po; however, aside from 
that, the choice of A is not important, since only Ap, is physically signifi- 
cant. The string equation now takes the form _ 


9 (7 9y\ _ 0? oy 


The time variation may again be taken as periodic: 
y=u(x)e (1.214) 
Thus (1.213) becomes 


a 
dx 


The method of solution is now the following; we expand u and w? as 
power Series in A: 


u=> du” (1.216) 


wo => "ow? (1.217) 


We then substitute in (1.215) and equate corresponding powers of X: 


(0) 
&(r a Gr) +P WO? YO = (1.218) 
d du™ 
77 (7 a) + py 2u + pow? + p,w2u =O (1.219) 


etc. Since the usefulness of the perturbation method depends on getting 
reasonably good results with a small number of terms of the series, we will 
not go beyond Eq. (1.219). 

Let the eigenvalues of (1.218) be w,? and the eigenfunctions y,. We 
consider the perturbation of a particular solution, let us say y,; that is, we 
take u = y,, w@? = w? 

The last two terms of (1.219) may be taken to the right-hand side; the 


54 The Vibrating String 


equation is then an inhomogeneous equation for u“”. However, we cannot 
directly apply the results of the previous section, because one of the 
denominators in G would be zero (that corresponding to n = s). Never- 
theless, we may proceed as before and expand 


u? =D CnYn (1.220) 
In addition, let us write 
psu = prys = Po X OsnYn (1.221) 
b,, is then determined by the equation 
ben = | PrsYndx (1.222) 


where we omit the limits of the integral throughout for convenience. 


Problem 1-20: By the variational method find cor- 
rections to w up to the third order. 


If each term in (1.219) is now expanded in terms of the functions y,,, 
the coefficient of each y, on the left-hand side will be zero. It is necessary, 
however, to treat the n = s term differently from the others. In that case 
the first two terms cancel and we have simply 


@?2 = — ese? 
=—o2 | pyyedx (1.223) 


This shows, in particular, that a positive perturbation decreases the 
frequency, while a negative one increases it. This result might have been 
surmised, of course, from the dependence of frequency on density in the 
case of uniform density. 

Going now to the coefficients in (1.219) of y, (n #), 


(w,? —- Wn7)Cn ove a Dsn@s” (1 .224) 


the third term in this case having contributed nothing. The coefficients in 
(1.220) are then determined, and to this approximation the perturbed 
eigenfunction is 


x / 
i ae | NP1 Ys YnAx (1.225) 


This is, of course, not normalized as it stands. 

It is interesting to note that if we were to use the variational principle 
to calculate the eigenvalue, and use as “trial function” simply the unper- 
turbed eigenfunction, 


19. The JWKB Method 55 


ot = f T (dy,/dx)*dx 
S (pot Api) ys?dx 
ee eee 
Taf Ap pede (1.226) 
Problem 1-21: Solve the case in which the pertur- 
bation is a small 5-function mass €6(x — x;). 


which corresponds to first powers in \ with the result of the perturbation 
calculation. This is consistent with the general features of the variational 
method, according to which an error of order d in the trial eigenfunction 
leads only to an error of order )? in the eigenvalue. The eigenvalue is thus 
given correctly to first order in \ without using the correction terms in 
(1.225). If (1.225) is used in the variational integral, the eigenvalue will, by 
the same argument, be in error by terms of order \‘4;. that is, it will be 
correct to the third order. 


19. The JWKB Method 


A useful approximate method, usually associated with the names of 
Jeffreys, Wentzel, Kramers, and Brillouin, applies to the case of a slowly 
varying density. We start from the spatial equation (1.115) obtained from 
the method of separation of variables 


£ (7 £) +e*ef= 0 


If T and p are constant, the solution has the form 


f(x) = Aeteloz (1.227) 
For slowly variable p, we try a solution of the form 
f(x) =A (x) es (1.228) 
Substituting in the equation and cancelling e*5, we derive the equation 
2 
A" +2iS'A' +iS"A—S"A +A =0 (1.229) 
where 
T 
u? = —— 1.230 
(x) Be) ( ) 


and primes indicate derivatives with respect to x. The real and imaginary 


56 The Vibrating String 


parts of the left-hand side must be separately zero, which leads to the 
equations 


25'A'+S"A =0 (1.231) 
and 
2 
A" +2-4—-S"A=0 (1.232) 
u 


If u(x) is slowly varying, the derivatives of both § and A will be small 
compared with the functions themselves. In particular, in the second 
equation (1.232) A” < w?/u?A and will be neglected. This equation may 
then be solved to give 


x 
ef SOR 
The lower limit may be chosen arbitrarily, but it is convenient to take it 


to be x = 0, which we assume to correspond with the left-hand end of the 
string. 


Equation (1.231) may be integrated to give 
BS i Ae 
A ime (1.234) 


where a@ is a constant. The approximate solution of (1.228) is then 


f(x) = constant - Vu exp [ i. ay te | (1.235) 
0 


u(x’) 
This is a formal (complex) solution. The real solution satisfying the boun- 
dary condition f= 0 at x = Ois 


f(x) = constant - Vy sin ff went (1.236) 
0 


If the other fixed end of the string is at x = L, the eigenvalue w is deter- 
mined by 


* dx' 


0 


20. An Example 


As an illustration of both the perturbation method and JWKB, 
consider the problem of a string of density 


P= pote sin (1.238) 


20. AnExample 57 


stretching from x= 0 to x=L. py and «€ are taken as constants. If € is 
small, the conditions for the validity of each approximation are satisfied. 

The modified frequency in the perturbation method, in which e, 
sin (7rx/L) is taken to be the perturbation, is given by (1.223): 


L 
wo = w,” [ _ I prytde | 


where w, is the unperturbed frequency. Evaluating the integral, it is 
found that 


1 
w= 0? |1-(1+g545)| (1.239) 


To determine the answer by the JWKB method, it is necessary to evaluate 


the integral 
L ’ L 7 
p(x’) ] [ € . 1X 
——— dx' =- 1 +— sin — dx’ 
{ T Cc 0 Po L 


Now, in the notation of (1.232), A” < (w?/u?)A only if € < py; conse- 
quently a sufficiently good approximation is obtained on putting 


arx’ le . mx 
le sit 15 sin L 


the integral of which is L(1+ (€/po7r)). Thus 


_ 1 Ss og 
© = 0s TT pon = oy | (1.240) 


Problem 1-22: Calculate the neglected term using 
the approximate solution obtained by neglecting it. Determine the condi- 
tion that this is small, and thus the condition under which the approxima- 
tion is justified. 

Can you suggest how the approximation could be improved? 


In contrast with (1.239), this predicts that all frequencies change in the 
same proportion. The perturbation method gives the more accurate re- 
sult in this case, since it is exact to first powers in e. The difference be- 
tween the two results approaches zero as the frequency becomes large. 
The tendency of the JWKB method to approach the correct value for 
large eigenvalues is related to the “correspondence principle” of quantum 
mechanics. 


58 ‘The Vibrating String 


Problem 1-23: Do exactly the case of density 
p’/x? between x = a and x = b > a. Solve by JWKB when (b—a)/a < 1. 
The normal modes are given by 


ee ni x 
v= onsin ray! (a) 


and the normal frequencies are 


_ anVvT 
On pin (b/a) 


By JWKB the normal frequencies are given correctly, but the normal 
modes are determined approximately to be 


ee ni x 
Yn = Cnx** sin ron (bla) In ()| 


21. Lagrangian and Hamiltonian 
Formulations of the Vibrating 
String Problem 


Equations (1.31) and (1.34) give formulas for the kinetic and poten- 
tial energies, respectively, of the vibrating string. From them we can 
formulate the Lagrangian of the system 


L=K-V 
_1f fay), _1 =) 
=3/ (2) dx 5 | T (2 dx (1.241) 


For systems with a finite number of degrees of freedom, the Lagrangian 
involves sums over quadratic forms in the generalized coordinates and 
their time derivatives. In the string problem, the system is continuous 
and the number of coordinates or degrees of freedom is effectively infinite. 
(This is not strictly true, of course; the string has a finite number of par- 
ticles in it. However, to describe it on that basis we would need to use 
quantum, not classical mechanics. We have somewhat idealized the prob- 
lem, by considering it as truly continuous —a reasonable assumption from 
the macroscopic viewpoint—and of negligible cross section. The conti- 
nuity of the system, and of the coordinates that describe it, are characteristic 
of this idealized string, and represent a reasonable compromise with literal 
truth dictated by our interest only in the macroscopic features of the 
system.) 

The coordinates of the problem are the displacements y(x, t). We 


21. Lagrangian and Hamiltonian Formulations 59 


expect them to be functions of time, but what of the variation with x? We 
see that x simply serves to enumerate the coordinates; to each value of x 
there is a coordinate. Rather than summing over coordinates we must, 
when they form a continuum, integrate over them. 

Suppose that we wish to write down Lagrange’s equations of motion. 
They may be expressed formally as 


d aL ) aL 
adie) (thc (On 1.242 
dt (55 (x,t)/ dy(x,t) : ( ) 
or, in this case 
daK , av 
——+— = 1.243 
dt ay a dy : ( ) 


We must first, then, differentiate the Lagrangian with respect to y(x, t) = 
dy/dt. That is, we must calculate the rate of change of the kinetic energy 
K with respect to a change in y. 

The problem is complicated, however, by the fact that, whereas, in 
the case of discrete coordinates, the contribution from each coordinate 
is finite, in the continuous case it is not. As a formal step around this diffi- 
culty, it is useful to consider, not just the point x, but a small region 
(x — 6/2, x + 6/2) around it; 5 can ultimately be made as small as we like. 

The contribution to K whose rate of change we wish to consider is, 
then, 

+p(x)[ y(x, t) ]?56+ terms of order &? 


The rate of change with respect to y !s, to the same order, 


py (x, t)6 


The time rate of change of this quantity, which constitutes the first term 
in the Lagrangian equation of motion for the variable y (x, t), is 


ary 


pdy (x, t) = pd af 


Consider now the second term in the Lagrangian. The contribution to 
V from the range 6 is 
xr+6/2 2 
5) T(S) ae. 
2 xr—§/2 Ox 


To calculate the derivative of this with respect to y, we must change y 
only within this interval; let the change be 7(x, t), which can be taken to go 
to zero at the ends of the interval. The change in V is then 


xL+5/2 
| 7 oy on dx’ 
xr—5/2 Ox Ox 


60 The Vibrating String 


This may be integrated by parts to give 


xr+6/2 
G ( ay) 
= x') => | T=; ) dx’ 
- n(x’) Ox Ox 
where we have neglected second powers of 7, since they make no contri- 
bution to the required rate of change. If, now, the interval 5 is made 


vanishingly small, (d/dx’) (T(dy/dx’)) may be treated as a constant in the 
evaluation to order 6, giving 


_ 0 oy xr+8/2 ; =-2( 2 _ 
ax (7 5) | ggg IRS oP og) 


where 77 is the mean value of the change in y in the interval. The rate of 
change of V with respect to this quantity is then 


_ 9 (7 dy 
Ox (7 aE 


Cancelling out the 6, we then arrive at the Lagrangian equation 


ay _ 2 (722) 
ot? ax\ ox 

which is, of course, the equation of motion previously derived. 

The preceding derivation represents an attempt to follow literally 
and directly the usual procedures for deriving equations of motion from a 
Lagrangian. It is a procedure singularly lacking in elegance, whatever its 
virtues in directness. The job may, however, be done more painlessly if a 
different starting point is used. It is shown in texts on analytic mechanics 
[see, for example, Goldstein, Classical Mechanics, Chapter 2, p. 30.] that 
Hamilton’s variational principle 


te 
8 f* Ldt =0 (1.244) 


is precisely equivalent to the set of Lagrange’s equations for conservative 
systems. Equation (1.244) envisages “paths’’ in coordinate space, pre- 
scribed by using time as a parameter, all paths going through the same end 
points. The “true” path, the one corresponding to the actual history of the 
physical system, is that which minimizes the integral in (1.244). 

There is no special difficulty in formulating the dynamics of our 
continuous system in this way. For the string problem, the required 
variational principle is 


te x2 ] oy Pak ay\? _ 
0) , dt [- E (2) 57 (3) Ja=o (1.245) 


21. Lagrangian and Hamiltonian Formulations 61 


The variational calculation parallels rather closely that in the section, 
earlier in this chapter, on the Rayleigh—Ritz method; we replace y by 
y+eq@ in the integral, and differentiate with respect to e« at e= 0. This 
leads to the equation 


2 (= ayad__ayad 
I, at | E dt at - 12 ee |a a (1.246) 


Since (1.246) must be satisfied for arbitrary values of @, we integrate by 
parts —the first term with respect to ¢, the second with respect to x—so as 
to obtain ¢ as a common factor of the integrand. The integrated term goes 
out in the f-integration by virtue of the formulation of Hamilton’s Prin- 
ciple; that in the x-integration by virtue of the boundary conditions of the 
problem. Thus, (1.246) becomes 


[a |” 6|-pS2+ 2(72)|ar=0 (1.247) 


The expression in the square bracket must then vanish for all values of x, 
and this leads to the Lagrangian equations of motion. 

Thus, the equation for vibrating string, in its usual form, is a formula- 
tion of the Lagrangian equations when the point-by-point displacements 
y(x, t) are taken as generalized coordinates. 

This calculation may be related to the previous one in the following 
way: If we choose for ¢ a 6 function 


go = 8(x—x’) (1.248) 


then we pick out simply the equation of motion for the coordinate x’; we 
vary only this coordinate. This is precisely what we did, rather awkwardly, 
in our direct calculation. 

Finally, let us consider a different set of generalized coordinates for 
the problem. The Sturm—Liouville equation associated with the string 
problem was given in Eq. (1.115): 


d (,,dy 
dx (TZ) + roy = : 
where A = w?, the square of the circular frequency. If the eigenvalues are 
w,” and the eigenfunctions y,(x), we have shown that any function satis- 
fying the boundary conditions of the problem can be expanded in a series 
in the eigenfunctions 

f (x) i z AnYn(x) 


In particular, then, the displacement y(x, ft) can be so expanded; the 
coefficients (amplitudes) of the various modes being functions of t: 


y(x,t) => cr(t)yn(x) (1.249) 


62 The Vibrating String 


The quantities c,(t) also serve as generalized coordinates for the prob- 
lem; they determine the motion completely. If the expansion (1.249) is 
substituted into the Lagrangian as given in (1.241), the Lagrangian may be 
expressed in terms of the c,’s. It is immediately evident, by virtue of the 
orthogonality of the y,,’s, that is, the fact that 


J PYnYmax = 0, nw#zm 


that the kinetic energy may be written 
K=3> ¢,’ (1.250) 


This follows the customary pattern, that the kinetic energy is a bilinear 
combination of the generalized velocities. 

Turning to the potential energy, the situation is at first sight more 
complicated. Suppose, however, that we integrate V by parts, and make 
use of the boundary condition that the displacement y vanishes at the 
ends of the interval. V may then be written 


a 2 < oy 
V= _ vag (T axe (1.251) 


x, and x, being, as usual, the coordinates of the ends of the string. 
Problem 1-24: If, in the derivation of the string 


equation, we had kept the next term in the expansion of the tensional 
force in terms of dy/dx, we should have obtained the equation 


| -3(2)'] 1% 
cD 2 \ax c? ot? 


for the case of constant density and tension. Show that, to this approxi- 
mation, the potential energy is 


1 L ay 2 1 YY 

2 . I. (2) [ Az 
when the string stretches from x = 0 to x = L. Show that, in terms of the 
amplitudes of the normal modes, the Lagrangian becomes 


L= 4 > le? = Wn’ Cn’ | —% > A mnarsCmCnCr©s 
n mnrs 


where 


21. Lagrangian and Hamiltonian Formulations 63 


L 
A mars = P [ Yn Yn tO dx 
__ _ rst 
aT ee 
+8(m—n—r—s)+6(m—n+r—s)+8(m—n—-rts) 
—8(m+nt+rt+s)—8(m+n—r—s)—8(m+n+r—s) 
—8(m+n—rts)] 


[86(m—n4+rts) 


the 6 being 1 when its argument is zero and zero otherwise. 


Let us now substitute in (1.251) the eigenfunction expansion of y. 
Making use of the eigenvalue equation 


9 (7 2Yn\ __ 2 
ax (7 2) Wn PYn (1.252) 


and once again using orthogonality, we find that V can be written in terms 
of the c,,(t)’s: 


V = Dd @,"c,” (1.253) 
Combining with (1.250) gives 
L=kK-V 
=D [6,2(t) —@,2¢,7(t)] (1.254) 


Thus, using as dynamical coordinates the amplitudes of the normal 
modes of the motion, we obtain the Lagrangian in a particularly simple 
form in which there is no coupling between the coordinates in the equa- 
tions of motion. The Lagrangian is, in fact, simply that of a system of an 
infinite number of isolated, noninteracting, harmonic oscillators. 

The peculiar simplicity of the problem when expressed in terms of 
these coordinates is worth noting, and has formed the basis for the com- 
monest method of dealing with the quantum mechanics of continuous 
systems. The quantum dynamics of a system depends on a knowledge of 
the Hamiltonian as a function of generalized coordinates and momenta. 
[In the string problem, the momenta are p, = dL/dc, = Cn, and the Hamil- 
tonian is H=K+V=32 (p,”+@,7c,").] Since these coordinates are 
denumerable (even though infinite in number) the dynamics of the con- 
tinuous system is mathematically equivalent to that of an infinite system 
of particles, in this case noninteracting. This is both a simple problem and 
one that is more or less famillar. 

This technique, which we have applied to a particular one-dimension- 
al continuous system, may be applied to more complicated systems; for 


64 The Vibrating String 


example, by analyzing the normal modes of the electromagnetic field in a 
given geometry, and with a knowledge of the field energy we may reduce 
the problems of electromagnetic theory to a form similar to those of the 
dynamics of particles. This makes it evident, then, that the quantum 
mechanics of continuous fields is no more difficult than that of systems of 
particles. 

Two further remarks complete our discussion of this problem. One 
concerns the apparent discrepancy according to which our system may be 
equally well described by a continuous system of coordinates y(x, t),ora 
discrete one c,(t). One has the feeling that these cannot describe the 
same number of degrees of freedom (though in fact the number is in each 
case infinite). The situation seems less strange, however, when we consider 
that in the second (normal mode) description we have used the boundary 
conditions in defining our coordinates, whereas this was not the case 
when we used y(x, t). Furthermore, if the coordinates y(x, t) are taken 
a priori as completely independent of each other, we could in principle 
describe completely discontinuous motions of the string. The expansion 
in normal modes, on the other hand, requires that the function describing 
the point-by-point displacement be a reasonably well-behaved function. 

The fact is, then, that the description in terms of normal modes ts, in 
the mathematical sense, a more restrictive one. This is not, however, an 
issue in the physical problem itself, since the boundary conditions, the 
good mathematical behavior of the displacement, are essential require- 
ments imposed on us by the physics of the system. The description in 
normal modes, then, does not, in fact, put any significant restrictions on 
the problem. 

The second remark concerns the problem of the forced motion of the 
system. The problem is, to determine the generalized force associated 
with the coordinate c,. This problem is one which is solved by a well- 
known prescription: The work done in a virtual displacement is calculated 
in terms of the virtual infinitesimal change in the generalized coordinates; 
the coefficients of these infinitesimal coordinate changes are the general- 
ized forces which appear as inhomogeneous terms in the Lagrangian 
equations. 

In terms of our original coordinates, we can write down this virtual 
work very simply. If F(x, t) is the force per unit mass on the string at x, it 
is 


SW = J ” o(x) F(x, t) Sy (x, t)dx (1.255) 
From (1.249), 


dy (x, t) = XY Sen (t) Yn (x) (1.256) 


21. Lagrangian and Hamiltonian Formulations 65 


If we similarly expand F (x, t), 


F (x,t) => F,(t)yn(x) (1.257) 
and use the orthogonality of the y,,’s, 
dW = > F,(1)8c,(2) (1.258) 


The generalized forces are then the coefficients of the expansion of the 
force per unit mass in terms of the eigenfunctions of the associated 
Sturm-Liouville problem. The equations of forced motion are then 


Cn + @n7C, = F,,(t) (1.259) 


PRELUDE TO CHAPTER 2 


In a sense, this chapter represents the theoretical 
foundation on which all the rest of the book is built. 
We sketch briefly the theory of linear vector spaces, 
using the notation and terminology introduced by 
Dirac in his treatment of quantum mechanics. In this 
theory, the eigenvalue problem is represented geo- 
metrically in a multidimensional space, and the con- 
cept of orthogonality of eigenstates (normal modes) is 
given a geometrical interpretation. 

The theory is very general; its application to the 
various linear problems of mathematical physics 
depends in each case on the definition of a scalar 
product of arbitrarily chosen vectors in the space. 
These vectors may, for instance, be functions, and the 
scalar product defined in terms of these functions. 
Using the so-called “Schmidt orthogonalization pro- 
cedure,” which permits the construction of mutually 
orthogonal “vectors”? out of arbitrary independent 
ones, orthogonal polynomials are constructed using 
different definitions of orthogonality. Legendre, 
Laguerre, Hermite, and Tschebycheff polynomials 
are introduced in this way. 

It is shown that in general vectors may be repre- 
sented by column (or row) matrices and operators 
by square matrices. Thus, linear differential operators 
may be put in one-to-one correspondence with matrix 
ones. The equivalence of Schédinger wave mechanics 
and Heisenberg matrix mechanics follows from such 
a correspondence. 

All vectors may be expressed as linear combina- 
tions of a set of “basis vectors,’’ a complete set of 
normalized orthogonal vectors. Transformations be- 
tween one basis and another leave the magnitudes of 
vectors unchanged. Such transformations are said to 
be “unitary.”’ The unitary transformation of operators 
is determined. 

The general theory of Hermitian operators is 
introduced. Since Hermitian operators are operators 


67 


68 Prelude to Chapter 2 


with real eigenvalues, and the physical quantities 
represented by those eigenvalues must be real, these 
operators play a central role in physical applications. 

In addition to having real eigenvalues, Hermitian 
operators are shown to have the property that their 
eigenvectors are mutually orthogonal (provided that 
they correspond to different eigenvalues). 

Fourier series are interpreted in vector terms, the 
vectors of the basis being the periodic expansion func- 
tions. This is in turn a particular case of the Sturm- 
Liouville problem. The differential operator of the 
problem (along with boundary conditions) determines 
a Hermitian operator. The eigenfunctions correspond 
to mutually orthogonal (basis) states. Thus, all the 
Sturm-Liouville problems occurring throughout the 
book are eigenvalue problems of Hermitian operators 
in linear vector spaces. 

A more mathematically rigorous and complete 
treatment of linear vector spaces may be found in 
books on pure mathematics. Particular attention is 
called to T. Kato’s Perturbation Theory for Linear 
Operators, especially Secs. 3 and $5 of Chapter 1 and 
Secs. 1 and 3 of Chapter 5. 


REFERENCES 


Dirac, P. A. M., Principles of Quantum Mechanics, 4th ed. London: Oxford, 
Clarendon Press, 1958. 


Goertzel G. and N. Tralli, Some Mathematical Methods of Physics. New York: 
McGraw-Hill, 1960. 


Hochstadt, H., Special Functions of Mathematical Physics. New York: Holt, 
Rinehart and Winston, 1961. 


Jackson, D., Fourier Series and Orthogonal Polynomials. Carus Mathematical 
Monograph No. 6. Oberlin, Ohio: Mathematical Association of America, 
1941. 


Jackson, J. D., Mathematics for Quantum Mechanics. New York: Benjamin, 
1962. 


Kato, T., Perturbation Theory for Linear Operators. Berlin: Springer, 1966. 


Mackey, G. W., Mathematical Foundations of Quantum Mechanics. New York: 
Benjamin, 1963. 


Prelude to Chapter 2 69 


Messiah, A., Mécanique Quantique. Paris: Dunod, 1959, Chapter 7. 


Sagan, H., Boundary and Eigenvalue Problems in Mathematical Physics. New 
York: Wiley, 1961. 


Temple, G., General Principles of Quantum Theory. New York: Barnes and 
Noble, 1934. 


LINEAR VECTOR 
SPACES 


“Don't let your simplicity be imposed on”’ 
R. B. Sheridan, The Rivals 


1. Introduction 


In the previous chapter we have developed a theory of eigenvalue 
problems in ordinary linear differential equations of the second order. 
These differential equations are formed with linear differential operators, 
and, in conjunction with boundary conditions, determine eigenvalue prob- 
lems. Linear combinations of the eigenfunctions serve to describe a gener- 
al class of functions satisfying the boundary conditions. This class of 
functions constitutes a particular example of what is called a “linear 
vector space’’; the differential operators (with their boundary conditions) 
are operators in this vector space. 

The concept of a linear vector space is, however, much more general 
than this example. It is, in particular, at the root of the formulation of 
quantum mechanics. It is, in fact, a concept so broad that it is able to 
encompass a large part of mathematical physics. We try therefore to 
formulate it in as comprehensive a form as possible. The elaboration of 
our scheme proceeds in the following way. First, we label the entities 
out of which the vector space is to be constructed. We adapt, for this 
purpose, a notation introduced by Dirac in his book on quantum mechan- 
ics and commonly used in quantum mechanics. These entities, accord- 


71 


72 Linear Vector Spaces 


ingly, are designated by the symbol |), which we call, following Dirac, 
“ket”? vectors, and, in one-to-one correspondence with these vectors, 
their “conjugates”’ (|, which are called “bra’’ vectors. 

We then proceed to define certain operations on these vectors; that Is, 
to specify their algebra. Some of these operations are defined, in the first 
instance, for the “ket” vectors; in each case we indicate the corre- 
sponding operations for the “bra” vectors. For simplicity, we refer in 
future to the |s)’s simply as “‘vectors,”’ and to the (|’s as their conjugate 
vectors. 

The algebraic postulates which follow are given an abstract formula- 
tion. Any set of entities conforming to these postulates will then consti- 
tute a particular realization of a linear vector space. It may be useful to 
consider the set of functions expansible in a series of the eigenfunctions 
of a Sturm-Liouville problem as a particular example, though, as we see, 
formally quite different realizations exist. 

(a) We first define an operation of summation, that is, we associate 
with two vectors |W,) and |W.) another vector which we call (|) + |W.)). 
This summation is assumed commutative: 


|W) + |e) = [We) + |p.) (2.1) 
and associative: 
[Wr + (lhe) + lbs) = (lb) + |e)) + [bs) 
= (|W,) + Ws) ) + | Wo) (2.2) 
Also, denoting the fact that (| is the conjugate of |y) thus: 
(| <> |) 


we assume that 


(hal + (Yel <> |b.) + |e) 


The relations (2.1) and (2.2) then imply similar relations for the conjugate 
vectors. 


(b) Before proceeding to the next postulate, we note that it would be 
natural to define |W) + |W) as 2%). In fact we define a ‘“‘ray’’ subspace of 
our vector space, each element of which is associated with a complex 
number a in association with the fixed vector |). The vectors of this 


subspace are designated a|w). The conjugate of als) is designated (y\a*, 
where a* 1s the complex conjugate of a. 


These vectors are assumed to have the following properties; 
corresponding relations exist for their conjugates. 


(ii) (ay +a) |b) = a,|) + ag|w) (2.4) 
(iii) a(b|b)) = b(al)) = ably) (2.5) 


3. Linear Independence, Dimensionality, and Bases 73 


(iv) al’) =|’) when a=1 (2.6) 


(v) als) will be a vector known as the “zero” vector, and will be inde- 
pendent of |w), when a = 0: 


Oly) = |0) (2.7) 


When the zero vector appears in a vector equation, it will often simply be 
designated as 0. 

We note that if we add to |) the vector (—1)|w) we get the zero 
vector. Therefore, (— 1) |) =— |) is called the negative of |w). It also 
follows from (2.4) that the result of adding the zero vector to |) is to 
give |) itself. 


2. Vector Spaces 


Let us now say what we mean by a “vector space.”’ Suppose we have 
a set of vectors |,), enumerated by real numbers i which may form either 
a discrete or a continuous set. (In the former case the i’s will be the 
integers 1,2,...,N, but with the possibility that N — «.) Then the set 
of vectors. 


> a;|¥s;) (2.8) 


where the a,’s are arbitrary complex numbers, define a “vector space”’; 
that is to say, the space consists of the set of vectors. 

In the continuous case, the vectors are designated by |,), specified 
by the continuous real variable x(x, < x < x,). Then the vectors of the 
vector space are 


J. a(x) Wedd (2.9) 


where a(x) is an arbitrary complex function. 


3. Linear Independence, 
Dimensionality, and Bases 


We say that n vectors |w;) are “linearly independent” if there exists 


no set of nonzero numbers c; such that 
n 


> Gite) =0 (2.10) 

It is evident that a set of linearly independent vectors is sufficient to 

define a vector space. If, in the set used to define the vector space in 

Sec. 2.2 above, there are relations of linear dependence such as (2.10), 

they may be used to eliminate some of the vectors by expressing them in 

terms of the others. Clearly, all the vectors (2.8) may then be expressed 
in terms of the smaller set. 


74 Linear Vector Spaces 


The set of vectors used to define a vector space may be called a 
“‘basis.”” We know, then, that a basis need only consist of linearly inde- 
pendent vectors. 

By following this line of argument, we can define the dimensionality 
of our vector space. This is the maximum number N of linearly independ- 
ent vectors which may be chosen from among the vectors of the space. 
Thus, between any set of more than N vectors |W;), (i= 1,2,...,n2 > N) 
there will exist a relation of linear dependence, that is, a relation of the 
form of (2.10). 

In the case of infinite-dimensional vector spaces, the space will be 
defined by what we call a complete set of basis vectors. This is a set of 
linearly independent vectors of the space such that, if any further vector 
of the space is added to the set, there will exist a relation of linear de- 
pendence connecting the vectors of the enhanced set. 


4. Scalar Products 


At this point, we introduce into our scheme the concept of the scalar 
product of vectors. It is in specifying the scalar products of the vectors 
that the properties of the space are truly defined; and the concept of 
multiplication of a vector by a number given meaning.’ 

We confine attention to vectors defined over the field of complex 
numbers; that is to say, to vector spaces in which the scalar product of 
any two vectors is a complex number. 

It is convenient to define the scalar product of two vectors in terms of 
one (ket) vector |w,) and the conjugate (bra) of another, (.|. This product 
is specified by the symbol (,|y,) and is referred to as the product of |#,) 
with |p). It need not, of course, be commutative, and in fact scalar 
multiplication will be defined in such a way that 


(Wy |e) =e (Wels) * (2.11) 


that is, so that if the vectors are interchanged in the multiplication, the 
product is transformed into its complex conjugate. 

It is now necessary to specify some further properties of scalar 
products. It is assumed that 

(a) the scalar product of |y,) + |W.) with |y,) is 


(Py |We) + (ha |Ws) 


(distributive law) 


Mt is true that in certain mathematical problems in particle physics use has been made of 
spaces in which only the norms (magnitudes squared) of vectors are defined (Banach spaces). 
Such problems are, however, beyond the scope of this book. 


5. Schmidt Inequality and Orthogonalization 75 


(b) the product of a|w.) with |w,) is 


acy, i) 
It follows from (2.11) that the product of |w,) with al.) is 


a* (abe |b) 
and the product of a,|y.) with a,|W,) is 


ay ay W,| We) 


The following deductions may then be made: 

(c) By (2.11) the scalar product of a vector with itself is of necessity a 
real number; however, we assume further that it is positive, and refer to it 
as the magnitude squared of the vector. Vectors whose magnitude squared 
is unity are called “normalized” or “unit” vectors. Clearly, any vector 
may be normalized by dividing it by the root of its magnitude squared. 


(d) By virtue of (2.7) the scalar product of the zero vector with any 
other vector Is zero. 


5. Schmidt Inequality and 
Orthogonalization 


If the scalar product of two vectors |,) and |W.) is written as an 
imaginary number z, it can be shown that |z|? = @ is less than or equal to 
the product of the squares of the magnitudes of |y,) and |): 


£7 = |Cdolbr) |? S Carle) (bebe) (2.12) 


To prove this result, we observe that the magnitude squared of the 
vector 


Wa) + Alpe, 


is greater than or equal to zero for arbitrary complex J, i.e., 


(|i) + AC, Le) +A* (hols) + [ALP CYelbe) = 0 


Putting 

(Wol,) = 2 = Ce (2.13a) 

h= pe (2.13b) 
this becomes 
M?( Wolo) + 2ul cos (a— B) + (uly) = 0 
for all « and £, or 
cos (a— B)]* <? cos? (a— B) = 
| (Wels) ar | +| dil) 20 


76 Linear Vector Spaces 


under the same conditions. Choosing 8 = a and pw such that the first term 
is then equal to zero, it follows that 


° 0 
(hi |Wr) ~~ (Wo| Wo) = 


which is Eq. (2.12). . 
In the case of a real vector space, that is, one in which all scalar 
products are real, one has simply that 


(dod)? = bi lbr) (hel be) (2.14) 


This relation is known as the Schmidt inequality. It 1s then possible to 
define an angle between two vectors |.) and |w,) by the equation 


(ight) _ 
$08 0 7s) Wala) 2-1) 


where (2.14) ensures that cos 6 = 1. 

If cos 6 = 0, that is to say, if (|W, ) = 0, it is then natural to say that 
the vectors are orthogonal. This terminology is used also in the case of 
complex vector spaces, though the concept of “angle’’ is not then gener- 
ally applicable. 

It is easily seen that the angle between arbitrary vectors x|a) and 
y|b) of two different “rays” is independent of the particular vectors (that 
is, of x and y), since these coefficients cancel from numerator and denom- 
inator of (2.15). 

We now define a method, known as the “Schmidt orthogonalization 
procedure,” for constructing, out of any set of linearly independent vec- 
tors, a set whose members are normalized and mutually orthogonal. Let 
the vectors be 


lw), [We), see 


The procedure may be described as follows: We may refer to the com- 
ponent of any vector |W;) in the direction of a unit vector |y;) as |g;) 
(gilb;). If this component is subtracted from |w;), the resulting vector, 
ls) — |o;) Cp; |B;), is orthogonal to |y;) since its scalar product with |¢;) is 


(pilW;) _ (pile) (Gil hj) =0 


If, now, the first vector |,) of the set considered is normalized by dividing 
it by its magnitude, we may subtract from the second vector its component 
in the direction of the first, to obtain a new vector which is orthogonal to 
the first. This in turn may be normalized. We may now take the third 
vector, and subtract from it its components in the directions of the 
first two, thus producing a vector orthogonal to each of the first two. 


6. An Example of the Schmidt Procedure 77 


This procedure may be continued until a mutually orthogonal set 1s 
arrived at. 


Problem 2-1: What happens to the Schmidt pro- 
cedure if the original set of vectors are not all linearly independent? 


The above remarks may be formalized by a mathematical induction. 
Let the normalized vector obtained from |¥;) by subtracting from it its 
components in the directions of |,), |We), . . .. |Wi-1) be designated as 
lo;). Then the vector 


[Wirr) => ps) (esl Wis1) (2.16) 


is orthogonal to the first i vectors. For if we take the scalar product with 
one of them, say the kth, by virtue of the orthogonality of all the vectors 
up to the ith, only the j = & term will survive from the sum, leaving 


(Gx |Wi+1) = (Gx |G) (Pel Wir) =0 


The vector (2.16) may now be normalized to give |y;,,), and the proced- 
ure repeated. In this way, orthogonal and normalized vectors may be 
generated until the original set is exhausted. 

This method may be used to create, from any given basis for a vector 
space, a basis of normalized, mutually orthogonal vectors. Such a basis is 
called an “orthonormal” basis. 

There is, of course, nothing unique in the Schmidt procedure. If one 
operates with the original set of vectors in a different order, a different 
orthonormal set will be generated. Still further orthonormal sets will be 
produced by using, in place of the given set, an arbitrary set of linearly 
independent combinations of them. 


6. An Example of the Schmidt 
Procedure 


As an illustration of a linear vector space, and of the generation of 
orthonormal bases, consider as the elements of our vector space all poly- 
nomials with real coefficients. These may be represented in terms of the 
basis 


| (Re str ce Lee 


where the variable x has a range to be specified. Their scalar products 
are determined in terms of those of the pairs of the bases. The definition 
of scalar products is arbitrary except for the limitation of satisfying the 


78 Linear Vector Spaces 


postulates given above. Consider the definition 
nr Mm : My it, 
(xx y= x x"dx 
a 
mt+tntl’ 
= 0, m+nodd 


Thus, the scalar product of two polynomials p; (x), po(x) Is 


m+neven (2.17) 


[, px) pox) de 


With this definition of scalar product, it is now easy to verify that the 
postulates of a linear vector space are satisfied. Addition and multiplica- 
tion by aconstant are the operations of ordinary algebra. It is immediately 
evident that this vector space comprises two subspaces —those of even- 
and odd-ordered polynomials, respectively, all vectors of one of which 
are orthogonal to all vectors of the other. Thus, the Schmidt procedure 
can be carried out in each subspace separately. 

Consider first the even subspace. Let the successive vectors of the 
orthonormal set be designated |g), |~2), |¢4),.- -, While those of the odd 


subspace will be |¢,), |¢3), |v), . - .. The first vector of the even subspace 
iS 


leo) = (2.18) 


The second is 


1 | | 
Ns (#*—(3" 5) 5) = Nel 2-3) 
: Va V2 *N 3 
where N, is the normalizing factor. It is determined by the condition 
No? ((x?|x?) —3(x?|1) + 5C1]1)) = 1 


= sao 
n= \e=3 » 


sot kee I 
\f2) = BS 5 (2.19) 


Problem 2-2: Show that 


ie Lo 
P4 2 3 


or 


Therefore, 


6. AnExample of the Schmidt Procedure 79 


Similarly, the first odd vector is |x). Its magnitude squared is § so that 


3 
le.) = Vie (2.20) 


Subtracting from x* the component in the direction of |y,) gives 
x3 —B(x3lx)x = x? —Bx 

|~3) is obtained by multiplying this by N3 such that 
N32(x3—8x|x?—3x) = 1 


This yields the value N, = V3. Therefore, 
7 5x* = 3x 
ee eles 2, 
l¢s) Vi 5 (2.21) 
Problem 2-3: Show that 


| ey eee aes 
Ps 2 8 


The orthogonal polynomials which we have calculated here are the 
normalized Legendre polynomials. Other sets of orthogonal polynomials 
may be determined by choosing different definitions of scalar product. 
Some familiar cases are the following: 


(a) Laguerre polynomials 
(x" |x") = in e-*x"x"dx =(m+n)! (2.22) 
(b) Hermite polynomials 
(x"|x") = iz e T2ymyndx = 0, m-+nodd 
= Dowtn-DI24° (ma) 


= (BE) oman VE m+neven (2.23) 


mi 1D(m-+n)/2 


(c) Tchebycheff polynomials 


ily n 


1 
(x|xn) = woo m+nodd 


ma(Ezan m+neven (2.24) 
7) 


80 Linear Vector Spaces 


These polynomials may be generated out of Sturm—Liouville equations as 
defined in the last chapter. Consider the four differential equations: 


LEGENDRE EQUATION 
a} y_ y2) ay = 
A K x) A +Ay=0 (2.25) 


defined on the range (— 1, 1), and with boundary condition that y is to be 
finite at + 1. We shall see in the next chapter that we must have A = ((/+ 1), 
where / is an integer, and that in this case the eigenfunctions are poly- 
nomials. Since the “weighting function” r(x) [Eq. (1.122)] is unity, these 
polynomials will be mutually orthogonal in the sense of (2.17). Thus, the 
Legendre polynomials generated above are the solutions of this equation. 


LAGUERRE EQUATION 
d? d 
x Fat (1-2) a thy =0 (2.26) 


on the range (0 < x < &), with the boundary condition of finiteness at the 
limits. By (1.127) this is converted into a Sturm—Liouville equation by the 


“integrating factor” 
_ 1 *~1-x’ ,, 
&= x exp I x’ dx | 
=e t 


(2.27) 
The equation becomes 


£ |xe= | +rAe-7ty =0 (2.28) 


Thus, the weighting function defining orthogonality for the polynomial 
solutions of this equation (which occur for A = positive integers 7) is e~7, 
as in (2.22). 


HERMITE EQUATION 


2 
Fr x Dry =0 (2.29) 


defined for (—%° < x <= «) with the boundary condition of finiteness as 
|x| > «. Polynomial solutions arise for \ = positive integers n. The 
integrating factor required to make this a Sturm—Liouville equation is 


g=exp f (—x')dr’ 
= ea? (2.30) 


6. An Example of the Schmidt Procedure 81 


The equation then becomes 
| | x22 dy —x2/2,) — 5 
E ol 4ne y=0 (2.31) 


The weighting function is therefore e~*””, as in (2.23). 
TCHEBYCHEFF an 


dy 
dx 


on the range (— 1 = x = 1) with the boundary condition of finiteness at the 
limits. Polynomial solutions are found for \ = n?, where n is an integer. 
The integrating factor (1.122) is 


g(x) = oper |- [ i= ae'| 
] 


(1— x?) o-x2 +rAy=0 (2.32) 


= 2.33 
7x? ( ) 
The equation in Sturm-Liouville form is, then, 
d 5 dy d _ 
¢(vI= x? 2) +0 (2.34) 


The weighting factor in the orthogonality relation, and therefore in the 
definition of scalar product (2.24) is thus r(x) = 1/V1—x?. 


Problem 2-4: Calculate the Laguerre, Hermite, and 
Tchebycheff polynomials up to n = 3. 


Problem 2-5: Show that in the case of the associated 
Laguerre equation 


f+ (at1—x) 2 pty = 0 


the weighting function in the corresponding Sturm—Liouville equation 1s 
x*e-*, Thus, determine the definition of the scalar product of poly- 
nomials, and calculate the polynomials up to the third order. 


Problem 2-6: Show that the substitution x = cos z 
makes it possible to find solutions of (2.34) in the form 
y = cos (n cos"! x) (2.35) 


Thus, find in an alternative way the Tchebycheff polynomials up to n = 4. 


82 Linear Vector Spaces 


7. Matrix Representation of 
Vectors and Transformation 
of Basis 


We now show that it is possible to represent our vectors by column 
or row matrices. This representation, however, is relative to an ortho- 


normal basis; if one changes the basis, the vector is represented by a 
different matrix. 


Suppose that the vectors |y;) constitute an orthonormal basis for 
our vector space. Any vector in the space may then be written 


ly) = x ail¢:) (2.36) 


Taking the scalar product with |y;), and using the orthogonality of the 
l~)’s, we get 
a; = (¢;|#) (2.37) 
so that 
lb) = >> lei) (ild) (2.38) 


The quantities a; may be taken to be the elements of a column matrix 


Qa, 
a2 


which represents the vector |) relative to the basis |¢;). 
If we consider two vectors |W) and |’), where 


lb) => ail¢i) (2.39) 
and 
lb’) => dilgi) (2.40) 
the scalar product of |W) with |W’) is 
(b' |b) = bia; (2.41) 


If (w'| is represented by the row matrix 
(bibs... DF...) 


7. Matrix Representation of Vectors and Transformation of Basis 83 


than the scalar product (’|y) can be represented by the matrix product 


ay 
ae 


a; 


Conjugate vectors (wW| will then be represented by matrices which are the 
transpose conjugates (Hermitian conjugates) of those which represent the 
vectors |W). The matrix multiplication representing (~|’) is derived 
from that representing (wW’|%s) by taking the transpose conjugate of the 
matrix product. 


Consider a transformation from a basis |y;) to a basis |g;). The 
vectors of the second basis may be expressed in terms of those of the first: 


lei) = 2 lei) (eile: ) (2.42) 
Alternatively, 

loi) = > les) es les) (2.43) 
Formally, ; 


p» les) <¢;| and » 1e5) <5 | 


may be considered to represent the unit operator. The vector |) may be 
expressed relative to either basis: 


by => a:|¢i) 
Ib) = > ailei) 
Substituting in the first of these equations for |y;) from (2.43) we find that 
> lei) (gilesa; =X |eidai 


from which it follows that 


and 


a; = 2 (pi 195) 4; (2.44) 


This may be considered as a matrix equation, in which the column matrix 
(a;) is transformed into the column matrix (a;) by the square matrix S$ 
whose elements are 


(gi |¢;) 


84 Linear Vector Spaces 


the index i designating the row and j designating the column; thus 
(;|y;) is the element of the ith row and the jth column of S. 
Using the orthogonality of the bases, it can be shown that 


S*S = 1 (2.45) 
where S* is the transpose conjugate of S, that is, the matrix with elements 
(y;|e}). For if we take the scalar product of (2.43) with |y;,) we find that 

Six =D (enlej) (v5 lei) (2.46) 
] 


6;, being the Kronecker delta, which is equal to unity when i = k and zero 
when i # k. In matrix terms, this is simply Eq. (2.45). 
If Eq. (2.42) is multiplied by |g; ) it follows similarly that 


SS* = 1 (2.47) 


A basis transformation of the sort considered here is known as a 
unitary transformation. The transformation on the conjugate vectors 1s 
represented by the Hermitian conjugate matrix S*, which is the inverse of 
the matrix of the original transformation. 


8. Linear Operators and Their 
Matrix Representations 


A linear operator L on a vector |) is an operator which produces 
another vector |): 


jb) = Lip) (2.48) 
and is such that if 7 
a) - Lib) 
and 7 
[We ) = L |b) 
then : : 
ay |W.) + asl.) = L(a,|W,) + a2|We) ) (2.49) 
Let us write the vectors in (2.48) in terms of a basis |¢;): 
ly) => a;|¢i) 
and _ 
ly) => alei) 


Then (2.48) becomes 
» a:\~i) = Ds a;L|¢;) 


Taking the scalar product of both sides with |y;) yields the equation 


a;= X (esl (Llei))ai (2.50) 


8. Linear Operators and Their Matrix Representations 85 


Thus the matrix with elements (¢;|(L|¢;)), operating on the matrix repre- 
senting |y), produces the matrix representative of ly). For brevity, the 
elements of this matrix operator are sometimes designated simply L,;,. 
Once again, this matrix representative of L depends on the basis chosen. 

This then suggests the question: What is the relation between the 
matrices representing the operator in two different bases? If the matrices 
representing |), |y), the basis transformation, and the linear operator L 
are designated by A, A, S, and L, respectively, (2.48) may be written 


A=LA (2.51) 


Let the matrices representing |x) and |W) relative to the transformed basis 
be A’ and A’. Then, by (2.44), 


and ia 7 sa ee) 
A'=SA (2.53) 
Since, from (2.45) and (2.47), S has the inverse S~! = $+, (2.52) may be 
written A=S7-14! (2.54) 
Multiplying (2.51) by S and substituting for A from (2.53) yields the 
ee L'A’ = SLS-' (2.55) 
It follows that the matrix representing the operator relative to the basis 
mr L' =SLS™ (2.56) 


This transformation of the linear operator L is often referred to as a 
“similarity transformation.” 


Problem 2-7: A general rotation in three dimensions 
may be defined in terms of the “‘Euler angles” @, 9, w, introduced in Gold- 
stein’s Classical Mechanics.2 The rotation is described by the sequence of 
three rotations on an original orthogonal triad: (1) rotation through angle ¢ 
about the z axis, (ii) rotation through angle @ about the new x axis, and 
(i1) rotation through angle w% about the direction of the z axis following 
the two preceding rotations. The matrix for this rotation is 


cos & COS cos # sing sin W sin 6 
— cos Osing sin & + cos 6 cos ¢ sin wy 


— sin W Cos ¢ —sinw sing cos wW sin 0 
— cos Osing cos w +cos 6 cos g cos w 


sin 6 sin ¢ —sinécos¢ cos 6 


2See bibliography for Chapter 1. 


86 Linear Vector Spaces 


(a) Show that it is unitary, that is, that AtA = AA* = 1. 

(b) Find the components of the final vectors of the triad relative to the 
original ones, and verify their orthogonality. 

(c) Show directly that, if two vectors are orthogonal, they remain so 
under the transformation A. 

(d) Determine the transformed form of the operator defining rotation 
through angle @ about the original z axis. 


9. The Eigenvalue Problem for 
Hermitian Operators 


_ Suppose that an operator L, acting on a vector |’), produces a vector 
|). We denote the fact that it operates on a vector to its right, a ket vec- 
tor, by an arrow pointing to the right: 


Ib) = Llp) (2.57) 
The corresponding operator which transforms the bra vector (| into 
(H| will be written as L: 

Cb] = (IL (2.58) 


Consider then the quantities (bolL l,) and (ho|L |w,)-2 They are not in 
general equal; one is the product of L|w,) with |W.) and the other the 
product of |%,) with Li). An operator for which these quantities are 
equal will be called a Hermitian operator. 

Hermitian operators are of very great importance in all branches of 
mathematical physics. Their importance stems from the fact that their 
eigenvalues are necessarily real, and an operator that has only real eigen- 
values is necessarily Hermitian. Since, as we shall see, many of the most 
important problems of mathematical physics are eigenvalue problems, in 
which the eigenvalues are physically observable quantities and must 
therefore be real, the theory of Hermitian operators is of central importance. 

The condition for a Hermitian operator is, then, that for any two 
vectors |w,) and |W.) 


(ho|L |s,) = (be| E les) (2.59) 


Thus, for Hermitian operators, it will not be necessary to put an arrow 
over the operator.‘ This condition may be written in another way. By 
virtue of Eq. (2.11), the right-hand side of (2.59) may be written as 


(ab, | L |.) *, leading to a formula in which right-hand operators L occur on 


3Note that, in the present notation, the operator L in (2.50) should be written L. 
4Which may equally well be taken to operate in either direction. 


9. The Eigenvalues Problem for Hermitian Operators 87 


both sides of the equation: 


(ao| Ey) = (Wy |L le) * (2.60) 


Two remarks should be made at this point concerning the matrix 
representations of Hermitian operators. First, from (2.50) and (2.60) it 
follows that Hermitian matrices (matrices of Hermitian operators) are 
their own transpose conjugates; that is, if one interchanges rows and 
columns and takes the complex conjugate of every element, they are un- 
changed. Second, from (2.59) it follows that the matrix which transforms 
the row matrices representing bra vectors is the same as that transforming 
the column matrices representing the corresponding ket vectors. In the 
first case, however, it multiplies the vector from the right, and in the 
second from the left. 

The eigenvalue problem is the following: To find vectors |) and 
numbers dX such that 


L\b) = Alp) (2.61) 
that is, to find vectors which are transformed by the operator L into mul- 


tiples of themselves, and the corresponding multiples.® 
We shall now prove the following three theorems: 


(A) The eigenvalues of Hermitian operators are real. 


(B) If (wlZ|w) is real for all vectors |), then L is Hermitian. 
(C) The eigenvectors of a Hermitian operator corresponding to 
different eigenvalues are orthogonal. 


THEOREM A 
Let |w) be an eigenvector and J its eigenvalue. Then 
> 
L |b) = Alp) 
The conjugate equation is 
(PL =r*(H| 


Multiply the first of these equations on the left by |y), and the second on 
the right by (|, and subtract. The left-hand sides cancel by virtue of Eq. 
(2.59). Therefore 

(A—A*) (bls) = 0 
If |e) is not the zero vector, (wiw) > 0,so A = A* and J is real. 
THEOREM B 


> <- 
If (| Ly) is always real, then by (2.11) it is equal to (w|Llw), so 
we can omit the arrow over the L in quantities of this form. Now take 


5The vectors satisfying (2.61) will be called eigenvectors, and the corresponding 
numbers A the eigenvalues. 


88 Linear Vector Spaces 


lw) = |b.) + |b), where |W) and |W) are arbitrary vectors. Since 
(wW,|Liw,) and (.|L|W.) are real, it follows that 


( ho] L |We,) + (aby| L |ebe) 


is real, or z _ 
Im (| L|y,) = —Im (wy |L |b) (2.62) 
(“Im” means “imaginary part of.’’) 
Consider next |b) = |W) —i|w.). Then (W/L) is equal to 
Cy] LZ |da) + (We| L [de — iL da] L [de) — (Wel Ll) ] 


which must also be real. Thus, 


> > 
Re (| L |b.) = Re (| L |p) (2.63) 
where “‘Re’’ means “real part of.” 
Adding to Eq. (2.63) i times Eq. (2.62) gives Eq. (2.60) 
> > 
(he |L |b) = (hy | L |W) * 
which is the condition that L is Hermitian. 


THEOREM C 


Let |wW,) be an eigenvector of L with eigenvalue \,, and |) an eigen- 
vector with eigenvalue \,. These statements may be expressed by the 
equations: 


Lib) = Al.) (2.64) 
and 


(pe |L a he We (2.65) 


In (2.65) we have made use of the reality of the A’s. If (2.64) is multi- 


plied on the left by (w,| and (2.65) on the right by |y,), and the equations 
are subtracted, we obtain 


(Ai — Az) (Welw) =0 


If A, and A, are different, the second factor must be zero: 


(Wels) = 0, 


so that the vectors are orthogonal. 

It may, of course, sometimes happen that several eigenvectors have 
the same eigenvalue. This is referred to as the “case of degeneracy”; 
a set of independent eigenvectors corresponding to a single eigenvalue is 
called a degenerate set. In this case they need not be orthogonal. How- 
ever, if |p), |b), ..., |W) all belong to the eigenvalue A, so does any 
linear combination of them. Consequently, by means of the Schmidt 
orthogonalization procedure, it is possible to construct from the given 
degenerate set a set of mutually orthogonal vectors. If this is done, all the 


10. Another Example 89 


eigenvectors of L will be mutually orthogonal. We shall see that such a 
choice has great convenience, and so will commonly be adopted. 

It is in general assumed that the eigenvectors of a Hermitian operator 
L form a basis for the vector space over which the operator is defined. 
In other words, any vector of the space will be expressible in terms of 
these eigenvectors |w;,): 


\y) = > Ci |) (2.66) 


We do not discuss here the conditions of validity of this assumption; it is a 
purely mathematical point which the reader will find discussed elsewhere. 


The orthogonality of the |y;)’s makes it possible to write (2.66) in 
the form 


1) => Wad vil) (2.67) 


It is worth noting, incidentally, that |¢)(x|, where |g) and (x| are 
arbitrary, is an operator, which may operate either to the right on a ket 
vector or to the left on a bra vector. From (2.42) or (2.67) it may be seen 


that 
S dC 


is the unit operator if the |w;) form a complete set of basis vectors. It also 
follows that 


DY (bbs) wily) = 1 (2.68) 


i 


if |’) is a normalized vector. We call this relation the “completeness 
relation.” 


10. Another Example 


A further example of a vector space is one consisting of the totality 
of functions expansible in Fourier series on the interval (— 1, 1) 


f (x) = ag ta, COS 7x+ a; Sin 7x 


+ a, COS 277x + ag Sin 27rx 


+a, COS 27nx + a, sin 27rnx 


(2.69) 


The set of coefficients may be taken to be the components of a vector or 
column matrix representing the function f (x). 


90 Linear Vector Spaces 


The vector space postulates are satisfied if the operation of addition 
is ordinary addition, and the scalar product of f, and f, is taken as 


(fA) = fo AG) Ala)dx (2.70) 


If we consider only the space of those functions whose derivative 
also has a convergent Fourier expansion, then the operator of different- 
lation is a linear operator. It converts the vector 


Ao 0 

ay TA; 

ay — 7a, 

Uo into 27a, 
/ 


and is represented by.the square matrix 


0 0 0 0 O 
0 0 7 0 O 
0 —7 0 0 O 
0 0 0 0 27 
0 0 (2.71) 


0 


which is, of course, of infinite order. 
The operator d/dx is not Hermitian, since 


[ ag fax=fs 


1 1 d 
ce fgg fadx 


If we define the functions outside the interval (— 1, 1) in such a way that 
they are periodic over this interval, so that the integrated term vanishes, it 
is clear that the condition for a Hermitian operator is not satisfied. The 
operator —i(d/dx) is, however, Hermitian. The determination of its 
eigenvectors and eigenvalues is left as a problem. 


Problem 2-8: For functions which, along with their 
first derivative, are expansible in a Fourier series on the interval (— 1, 1), 
show that —i(d/dx) is Hermitian and find its eigenvalues and eigen- 
vectors. 


11. Sturm—Liouville Problem and Linear Vector Spaces 91 


It should be noted in passing that we have established in this parti- 
cular case a correspondence between a differential operator — i(d/dx) and 
a matrix operator [—i times (2.71)]. This correspondence is of the sort 
which is involved in the connection between the Schroédinger and Heisen- 
berg schemes of quantum mechanics (see Chapter 8). 


11. Sturm—Liouville Problem and 
Linear Vector Spaces® 


In this section we interpret the problem of the Sturm—Liouville 
equation, discussed in Chapter 1, as a problem in linear vector spaces. 
The linear vector space involved is, however, a slightly special one, in 
that it will be possible to define scalar products as real quantities rather 
than complex ones. 

Consider the Sturm—Liouville equation 


i- = (p <) + aly = L(y) = ary (2.72) 


defined on the range (a, b) and subject to boundary conditions such that, 
if y,, and y,, are two solutions 


b 
= 0 (1.133) 


a 


P(YmYn— YnYm) 


Then, as we have shown, y, and y,, are orthogonal, in the sense that 
b 
f ym(x)yn(x)r(x)dx=0, men (1.132) 
a 


There may be at most a twofold degeneracy (two y’s for a given eigen- 
value A); two independent solutions may then be found, using the Schmidt 
procedure, which are mutually orthogonal. We may also normalize all the 
eigenfunctions y,: 


[ yn2(x)r(x)dx = 1 (1.137) 


The vectors of our linear vector space will then be taken to be the set 
of functions which may be expanded in terms of the eigenfunctions of L: 


le) <> Dd anyn(x) =f (x) (2.73) 


where the a,’s are taken to be real constants. The operations of addition 
and multiplication by a constant can obviously be defined, and the exis- 
tence of the zero vector (all a,,’s = 0) verified. 

As for the conjugate vectors, we may in this case define them as 


6For a more critical and rigorous discussion of this question the reader is referred to the 
book of Kato mentioned earlier, Chapter 5, Sec. 3.6. 


92 Linear Vector Spaces 


identical with the vectors themselves (self-conjugate system). The scalar 
product of two vectors 


Wid > > an? yn = fi (x) (2.74) 
and 


[Wo) <> an? Yn = fo(x) (2.75) 
will be defined as 


(Vile) = (Welds) = f° fi) era de (2.76) 


Here ““<>”’ designates a correspondence. The vector |y,) is the function 
fi; we, however, do not use the equality symbol “=’’ because we preempt 
it to indicate relations of equality between elements designated in the 
same way [either in the vector notation |) or the conventional functional 
notation f (x)]. 

It follows immediately from (2.76) that the y,’s constitute an ortho- 
normal basis 


len) > Yn (2.77) 


Arbitrary vectors can be represented by column matrices whose elements 
are the expansion coefficients a, where 


an= f furde= (pal) (2.78) 


Similarly, the conjugate vectors may be represented by row matrices 
having the same set of components. The vectors |y,) correspond to 
matrices with all components zero except the nth, which is unity. The 
scalar product of |w,) and |W), (f, and f,), is 


Wille) =f ffrde =X a,'Pa,” (2.79) 


Various kinds of linear operators on the functions f can be defined; 
for example, any linear differential operator, or linear integral operator, 
of which an example could be 


If) = fo fg e—2') ae’ (2.80) 


where g is a Suitably chosen function. 

It 1s easily seen that the operators associated with the Sturm- 
Liouville equation are Hermitian; this follows from our definition of the 
set of vectors as self-conjugate. We have 


> | 
L\p) <> —L(f) (2.81) 
The conjugate vector 


(WIL 


11. Sturm—Liouville Problem and Linear Vector Spaces 93 


also corresponds to the function (1/r)L(f). Then 


> b 
(Wel LI) =f AL(f)dx (2.82) 


and 


< b 
(belL ii) = J L( fe) fide (2.83) 
Now, using the form (2.72) for L(f), 
[. AL fde—f RLAde= pif -Af)| (2.84) 


Since the f’s satisfy the same boundary conditions as the eigenfunctions 
yn in terms of which they may be expanded, it follows from (1.133) that 
this is zero. We have therefore verified the Hermitian property (2.59). 

It is useful, finally, to write the Rayleigh—Ritz variational principle 
(1.139) in terms of vector notation. From (2.82) 


(WILY) = f° fL(f)ax 
= [ f\-& |p (x) A +q(x)f}de 


b 
= if (pf'? + af?) ax (2.85) 
under the condition of validity of the Rayleigh—Ritz principle, that is, that 
b 
pff'| =90 (1.149) 


Thus, the numerator in (1.139) is (b|L]w). But the denominator is (yy). 
Thus, (1.139) becomes 


I(f) =90 (2.86) 
where 
1) =a Cen 


This is equivalent, by (1.151), to 
SL (WIL) — Ady) ] = 0 (2.88) 


If the |yw)’s are expanded in terms of an orthonormal basis set |n) (not 
necessarily eigenstates of L), 


lb) => |n)(nlb) => |n)an (2.89) 


where the a,,’s are to be determined, (2.88) becomes 


p> atm (ML 2) — XB yu) | = 0 (2.90) 


94 Linear Vector Spaces 


Treating the a’s as variational parameters, this leads to the equations 


> ({m|L|n) —ASnn) An = 0 (2.91) 
or " 
>, (m|L|n) an = Am (2.92) 


This is, of course, simply the matrix formulation for the eigenvalue equa- 
tion for L in the representation |n). 

In summary: The Rayleigh—Ritz variational principle for the eigen- 
values (and eigenvectors) of L may be written in the vector form (2.87). If 
the vector is expressed in terms of an arbitrary basis, the Rayleigh—Ritz 
principle leads to the matrix eigenvalue problem in which the matrix 
representing L has the “matrix elements” (m|L|n). 

If a nonorthogonal basis |n’) is used, the principle takes the form 


XY ((m'|L|n’) —Mm'|n'))dy = 0 (2.93) 


Further Problems for Chapter 2 


Problem 2-9: Consider the vector space in which the 
vectors are the column matrices 


V; 
\V) = (v) 
V3 


the V’s being complex numbers, and the operator 


0 —n, No 
= =I] Ng 0 — ny, 
— Neo ny, 0 


where n,2+ n2? + N37 = | 

(a) Show that L, is Hermitian. 

(b) Show that its eigenvalues are + 1 and 0, and that the correspond- 
ing eigenvectors are 


1 Pitid, ny 
V9 Pextig,| and |{n,]|, respectively, 
P3tigq Nn. 


where 


Further Problems for Chapter 2 95 


are unit vectors orthogonal to 


ny 


and to each other. 

(c) Show directly that L,? = L,, and that this permits an alternative 
derivation of the eigenvalues. 

(d) Show that 


Pi Pe Ps 
S=(qdi qe 3 
Ny Neg Nz 


is a unitary matrix. Determine the transform of the matrix L,, namely, 
SL,S~. By transformation, determine its eigenvectors. 
(e) Find aunitary matrix which transforms L,, to diagonal form. 


Problem 2-10: Consider the vector space in which 
the vectors are the two-dimensional complex matrices |s) = (51), and the 


operator 
n ny—in 
Cn, = 3 . 1 2 
ny + Io ar Ng 


where n,?+ nq? + ng? = 1. 

(a) Show that a, 1s Hermitian. 

(b) Show that the eigenvalues are + 1, and that if the n’s are written 
in polar coordinates n, = sin 8 cos ¢, Nn, = Sin @ sin ¢, nz = cos 0, the eigen- 
vectors are 


i 6/2 an) d ( sin 9/2 e~i#!2 ) 
sin 6/2 ei?/2 —cos 6/2 ei#2 
respectively. 


(c) Determine the eigenvalues by verifying that a,” = 
(d) Show that 


= i 6/2 ei?!? sin 6/2 e #2 ) 
sin 6/2 ei?! —cos 6/2 e~i#/2 


is a unitary matrix. Use it to find the transform So,,S— of the matrix o,. 
Solve for the eigenvectors of the transformed matrix, and from them derive 
the eigenvectors of o,. 


Problem 2-11: Representing the fourth-degree 
polynomial 


Ag + A,X + Aox? + a3x? + ayx* 


96 Linear Vector Spaces 


by the column vector whose components are the coefficients, show that 
the operator 


I|x") = f x! Me-(2-2 Jy! 
—o~ 
is a linear operator in the vector space. Find its matrix form. 


Problem 2-12: Find the eigenvalues and normalized 
eigenvectors of the matrices 


000 1 

00 1 0 

0 10 0 

1000 

and 

ba 0 0 a 
abaoqdo0o9O 
0abao09o0 
0 0aba oO 
00 0a ba 
a0o0O0a b 


Problem 2-13: Without calculating the eigenvalues, 
find for the matrices in the preceding problem 


(a) the sum of the eigenvalues 
(b) the sum of their squares. 


PRELUDE TO CHAPTER 3 


This chapter is devoted to potential problems, in which 
the potential satisfies either the Laplacian equation 
(V2@ = 0) or the Poisson (inhomogeneous) equation 
V7 = p, where p represents a source. The problem to 
which most attention is given is that of electrostatic 
potentials. However, the same equation occurs in 
magnetostatics, the theory of steady-state heat con- 
duction, irrotational fluid dynamics, or steady current 
flow in electrodynamics. 

We first show that irrotational vector fields can be 
represented as the curl of a potential. This is then true 
of the electrostatic field. In electrostatics, by virtue of 
the field equation (Gauss’ law) the potential satisfies in 
general Poisson’s equation. 

However, in the magnetostatic problem a similar 
equation is arrived at in a different way. Since magnetic 
fields are solenoidal (V - B = 0) it is shown that they 
can be written as the curl of a vector potential. But 
since the curl of a constant magnetic field is propor- 
tional to a steady current, it follows that this field also 
satisfies an equation of Poisson type, the source term 
now involving the current. 

Taking first the case of the homogeneous (Laplace) 
equation which holds away from sources, -we use the 
method of separation of variables to solve the equation 
in spherical polar coordinates. (It is assumed that the 
reader knows the form of the vector operators and the 
Laplacian in arbitrary orthogonal curvilinear coor- 
dinate systems; they are derived in all books on ele- 
mentary vector analysis.) The angular functions are 
the so-called “spherical harmonics,” which in turn 
depend on the polar angle @ through associated Legen- 
dre polynomials. These functions are determined from 
their separated differential equations by the ‘‘factoriza- 
tion method” first introduced by Schr6édinger and later 
elaborated by L. Infeld and T. Hull.! This method is 


1The Factorization Method, Rev. Mod. Phys. 23, 21 (1951). 


97 


98 


Prelude to Chapter 3 


used subsequently in certain quantum problems in 
Chapter 9 (harmonic oscillator, hydrogen atom). 

Making use of a physical argument involving the 
potential of a charge on the polar axis we find an alter- 
native way to generate the axially symmetric Legendre 
polynomials, without solving a differential equation. 
What is obtained is in fact a generating function for 
these polynomials. It is used to derive some useful 
recursion formulas for them. 

Returning to the same generating function and 
expressing it in terms of Cartesian coordinates, it is 
shown that the Legendre polynomials can be expressed 
in terms of derivatives of the potential of a point 
source at the origin, namely, 1/r. The derivatives of 
various orders may be interpreted as the potentials of 
electrostatic multipoles. 1t follows that the expansion 
of any potential in spherical harmonics is a multipole 
expansion, the coefficients being essentially multipole 
moments of the distribution. 

Non-axially-symmetric multipoles are shown to 
be generated by appropriate derivatives of 1/r with 
respect to all three Cartesian coordinates. By com- 
paring the r and ~ dependence of these solutions with 
those obtained by separation of variables, we are able 
to express associated Legendre functions also in terms 
of such derivatives. Again, certain useful properties 
of these functions may be derived. In particular, an 
essential addition theorem for spherical harmonics, 
which is needed in Chapter 5 in the development of 
radiation theory, is derived. 

Electrostatic problems are of two types. In one, 
charge distributions are given, and the problem is to 
calculate the resulting potentials. These problems may 
be solved by a Green’s function method, in which the 
potential is considered to be the sum (integral) of those 
due to the point sources comprising the distribution. 

A particular case which is easy to solve is that in 
which there is axial symmetry. In this case, it 1s shown 
how the potential at any point may be determined if 
that on the symmetry axis is known. 

The second type of problem involves boundary 
conditions (for example, on conducting surfaces); 
charge distributions will in general exist on the boun- 


Prelude to Chapter 3 99 


daries to enable these conditions to be satisfied, but 
are not known a priori. In certain cases (for example, 
plane surfaces, spherical surfaces), methods of 
“images” may be evolved; these are fictitious charges 
which may be seen by symmetry to ensure the satis- 
faction of the boundary conditions. It is shown how to 
calculate the charge distributions on these surfaces 
once the potentials are known. 

We next derive relations, applicable to any linear 
problem (and therefore applicable also in wave, 
diffusion, heat conduction problems, and so forth) 
between the Green’s functions in different geometries. 
These connect the potentials of unit point source, unit 
plane source, and unit spherical shell source. 

The chapter ends with a number of worked prob- 
lems. There is an appendix on a recursion relation for 
associated Legendre functions, and another on linear 
differential equations of the second order. The latter 
appendix shows how to derive the Green’s function 
for any such equation, and thus, how to solve the 
inhomogeneous equation knowing how to solve the 
homogeneous one. 


REFERENCES 


Coulson, C. A., Electricity. Edinburgh: Oliver and Boyd, 1951. 


Jackson, J. D., Class 


ical Electrodynamics. New York: Wiley, 1962. 


Mac Robert, T. M., Spherical Harmonics. New York: Dover, 1947. 


Sagan, H., Boundary and Eigenvalue Problems in Mathematical Physics. New 
York: Wiley, 1961, Chapter 8. 


Sommerfeld, A., Partial Differential Equations in Physics. New York: Academic, 


1949, Chapter 4. 


5 


THE POTENTIAL 
EQUATION 


“The diversity of physical arguments and opinions 
embraces all sorts of methods”’ 


de Montaigne 


1. Introduction; Electrostatic 
Potential 


The potential equation makes its appearance in physics whenever 
one has fields F(r) which, in some portion of space at least, are both 
source-free (V - F = 0) and irrotational (V X F = 0) (r is the position vec- 
tor of an arbitrary point). Any irrotational field can be expressed as the 
gradient of a potential 


F(r) = — V(r) (3.1) 
where 


g(r) = f ‘ Fds (3.2) 


the integral being a line integral along an arbitrary path from an arbitrary 
base point ro. 

The irrotational feature ensures that @ depends only on the point r, 
and not on the path of integration. For if c and c’ are two contours be- 
tween r, andr 


[ F-ds—J_ F-ds 


101 


102 The Potential Equation 


"0 
Figure 3.1 


is the line integral of F around a closed contour. By Stokes’ theorem, this 
is equal to the surface integral 


J VxF-nds 


over any surface bounded by the contour. n is the unit normal vector to 
the surface. Since F is irrotational, this 1s zero, and therefore the two 
line integrals are equal. 

If we substitute from (3.1) into the equation V - F = 0 we obtain the 
potential equation 


V2 = 0 (3.3) 


This is the ““Laplace equation.” 

It should be noted that, although we started with the problem of 
determining a vector field, it has now been reduced to one of solving for a 
scalar field. 

It is, of course, possible to introduce the potential for any irrotational 
field, even when sources are present. Since the divergence of a vector 
is its efflux per unit volume, a source (of flux), which may be described by 
a density function p(r), appears on the right-hand side of the divergence 
equation, so that one has 


V-F= 0 (3.4) 
in which case the potential satisfies the inhomogeneous equation 
V*p =—p (3.5) 


This is known as “Poisson’s equation.” It is generally true in physics 
that if one can solve the homogeneous (source-free) problem, one can 


1. Introduction; Electrostatic Potential 103 


construct from it the solution of the inhomogeneous one. We will see 
that this is the case with potential problems. 

There is another type of problem, in which the Laplace or Poisson 
equation is satisfied by a vector field. In this problem the sources are 
sources of rotation (as in the problem of vortices in hydrodynamics), but 
the field is everywhere solenoidal (V-F=0). It is another standard 
theorem of vector calculus that a solenoidal field can always be expressed 
as the curl of another vector, which we call the vector potential: 


F=VX<A (3.6) 
Designating the density of the sources of circulation by the vector J: 
VxF=J (3.7) 
If (3.6) is substituted into (3.7), an equation is obtained for A 
Vx(VXA)=J (3.8) 
Using a familiar vector identity on the left-hand side this becomes 
V(V-A)-VA=J (3.9) 
Now A Is not uniquely determined by (3.6) when F is known. In 
fact, 
A’=A+Vx (3.10) 


represents the same field. But it is always possible to choose x so that 
V-A’ =0; it is a solution of the equation 


Vx=V-A (3.11) 
If the vector potential is so chosen, (3.9) takes on the form of a Poisson 
equation: 
VA=-—J (3.12) 
If there are no sources, J = 0 and we have a Laplace equation for A 
V7A=0 (3.13) 
However, in this case it is clearly simpler to represent our field by a 


scalar potential. 
The difference in the two cases which we have discussed is this: 


104 The Potential Equation 


Sources of flux are scalar, and so the field, too, is scalar. Sources of 
circulation, on the other hand, have a vectorial (directional) character, 
and hence produce a vector field. 

Both of these situations are met in electromagnetic theory. To see 
this let us start from Maxwell’s equations: 


(I) 

V:-D=p (3.14) 
which expresses Gauss’ theorem, that the sources of electric 
flux are charges; p is the charge density. 

(II) 

V-B=0 (3.15) 
which says that there are no magnetic sources (magnetic 
poles). 

(IIT) 
_ OB (3.16) 
VxXE= ry; 
This is the vector expression of Faraday’s law of induction; 
it states that a time-varying magnetic flux produces a circula- 
tion in the electric field. 
(IV) 
D 
VxXH=J+< (3.17) 


This equation is a generalization of Ampére’s law. In the steady state 
8D/dt = 0, and the equation states that a current produces a circulation of 
magnetic field. The second term on the right-hand side expresses the 
discovery of Maxwell that a “displacement current” also produces a 
circulation in the magnetic field.? 

The preceding equations must be supplemented by one which relates 
the displacement D to the electric field E: 


D= cE (3.18) 


where €¢ is the “dielectric constant,’ and another which relates the mag- 
netic induction B to the magnetic field H: 


B= pH (3.19) 


where yz is the “magnetic permeability.””» We consider only problems in 
which € and p are constant. 


1See J. D. Jackson, Classical Electrodynamics, p. 178. 


2. Solution of Laplace’s Equation in Spherical Coordinates 105 


If we confine our attention to “steady-state” problems, the time 
derivatives on the right-hand sides of (3.16) and (3.17) are zero. Coupling 
between the electric and magnetic fields then ceases to exist, and the 
theory of the electrostatic field is to be found in Eqs. (3.14) and (3.16), 
while that of the magnetostatic field is given by (3.15) and (3.17). 

Considering first the electrostatic case, we may write 


E=—V¢ (3.20) 
where @ is the electrostatic potential, and 
Vp = = (3.21) 
It will be found convenient, to avoid writing awkward factors, to introduce 
® = 4rred (3.22) 
from which it follows that 
V*® =— 4p (3.23) 
In terms of ®, the electric field is 
E= i V® (3.24) 


Turning to the magnetostatic case, the situation closely resembles 
that envisaged in Eqs. (3.6) and (3.7). From (3.15), 


B=VX<A 
If Ais chosen so that V - A = 0, (3.17) yields for A the equation 
V-A=—pJ (3.25) 
There is some convenience in introducing 
of = “a A (3.26) 
so that 
V72 = —4arJ (3.27) 


which is quite analogous to (3.23) except that it is a vector rather than a 
scalar equation. 


2. Solution of Laplace’s Equation 
in Spherical Coordinates 


In spherical polar coordinates (r, 0, ¢), the Laplacian equation takes 


the form 
14a(,9), 1 a/, ,a® 1 ed _ 
r? or (r or Te sin 6 (sin : a 7 r? sin? 8 dg? ’ (3.28) 


106 The Potential Equation 


The equation may be solved by using the method of separation of variables 
in two stages. First, put 


® = R(r)S(0, ¢) (3.29) 
Substituting in the equation and dividing by ® gives 


1 d(..dR ] as 4 9°S = 
agg (" BR) + savy | sin 0S (sin as a) + os |=0 (3.30) 


The first term is a function of the radial coordinate only, and the second 
of the angular coordinates. For the sum to be zero, each must be a 
constant, that is, 


dR 
Gs (> B) = (3.31) 
| 0 0S\ , 0S 

Fanrg| sin 9S, (sin 033) +25] = a (3.32) 

We next multiply (3.32) by S sin? 6 and put 
S=f(9)e(¢) (3.33) 

to get 
2 

{sin 95, (sino gh) +a sin? 0 +252 =0 (3.34) 


Again, the term in the curly bracket and the last term must be equal and 
opposite constants, which we write m?, — m?. Thus 


sin 6 d6 (sin e 4) - (x— sin? a ie 0 (3.35) 
and de 
do? =— mg (3.36) 
Since the solution of (3.36) is 
g=acosme+bsinme (3.37) 


and the solution must be single-valued (that is, must remain unaltered 
when ¢ is increased by 27r), m must be a real integer. 


3. The @ Equation and the 
Factorization Method 


The next equation to be solved is (3.35), for the function f (0). We see 
that this defines an eigenvalue problem, since f is finite for all angles 6 
only if A has appropriately chosen values. 


3. The 6 Equation and the Factorization Method 107 


The method which we use for the solution of this problem is the 
‘factorization method,” which is discussed in detail for a wide range of 
problems in an article by L. Infeld and T. Hull.” 

The Equation (3.35) for f(@) may be written 

d 


SE + cot 0S — m? cso? of =—rf (3.38) 


We try to “‘factorize” the left-hand side into two first-order operators: 
d d 
(te cot 0) (G+e cot 0) f 


= et (a+ B) cot 0%5~ 6 csctOf+ af cot’ of 


= + (a+p) coro af _ — B(1— a) csc? 6f— apf (3.39) 


The first three terms of this expression can be made to correspond with 
the left-hand side of (3.38) by choosing a and B so that 


atBp=1 (3.40) 
B(1—a) =m (3.41) 
These equations have two solutions: 
(a) B=m, a=—(m—1) 
Let us define two operators 
d 
(yi 
M,, 16” cot 6 (3.42) 
ae a 
M,,‘ = Tet (m+ 1) coté (3.43) 


The signs used to designate the M’s, which seem at first sight illogical, will 
be justified as we proceed. 

Using (3.39), (3.42), and (3.43), Eq. (3.38) may now be written in 
either of two ways: 


or 
Mn OM afin = [A—m(m+ 1) VS (3.45) 


2"The Factorization Method,” Rev. Mod. Phys. 23,21 (1951). 


108 The Potential Equation 


Let us operate on the first of these equations with M&,: 
MS? MS? (Me fm) =— A— m(m— 1) (Ma fin) 
Comparing with (3.45) it may be seen that we may put 


Ma finer = PmJtm (3.46) 
where p,, is a constant. 
Similarly, operating on (3.45) with M,,‘~ gives 


Ma? MOM fn) = — TA mn +1) (Man fa] 
which, on comparison with (3.44), permits us to conclude that 


Mi fn = ImSn+1 (3.47) 


Equations (3.46) and (3.47) show how, starting with the solution fora 
given m, we may get those for m increased or decreased by unity. 
Let us next prove the following theorem: That 


JP PMaOfsin 6 do =— |" fMyF'sin 6 do (3.48) 


where f and f are arbitrary bounded functions of 6. The proof is quite 
direct. For 


[ sin oF | 5+ (m+ 1) cot a| f de 


Q 


= sin off -| f | ¢sin Of )— (m+ 1) cos of do 
0 0 
=— [’ sin Of ao cot a| f do 
0 dé 
We now apply this theorem to two different cases: 


(i) f= Smtr f= Ma Fin+1 


and 
(ii) f=M, fn,  f=Sn 
In case (i) Eq. (3.48) yields 


in Srna .Man'\ OM a Fins sin 0 dé —=—_ f (Min fin+1) (Mn fn-+1) Sin 6 dé 


By (3.44) the left-hand side is 
—[A-—m(m+1)] fo Feast sin 6 dé 


3. The 6 Equation and the Factorization Method 109 
whereas by (3.46) the right-hand side 1s 
=p.- (ee sin 6 dé 
0 


We may normalize our f,, functions so that the integrals are equal to 
unity; in that case 


Pm = VA-—m(m+1) (3.49) 
Proceeding in an exactly similar way in case (ii), it is also found that 
Gm = VA-—m(m+1) (3.50) 


Thus, with these values of p,, and g,,, (3.46) and (3.47) show that M,,“~ 
and M,,‘?, respectively, convert normalized solutions of a given order 
into normalized solutions of order lower and higher, respectively, by one 
in m. 

Since, however, (3.49) and (3.50) show that m(m+1) <A, m can 
neither be raised nor lowered indefinitely. Letting m, be the maximum 
value of m, and m, its minimum, each must satisfy the equation 


m(m+1)=a (3.51) 


since otherwise (3.46) or (3.47) would produce solutions with m lower 
than the minimum or higher than the maximum, respectively. Thus we 
must have 


A=/1(1+1) (3.52) 
where / is an integer. The maximum m is then seen to be / in (3.47), the 
minimum — (/+ 1) in (3.46). Therefore, for a given /, the f’s run from f_, 
tof. 

Equations (3.46) with m =— (/+ 1) and (3.47) with m = 1 give equa- 
tions for f_, and f,, respectively, as follows: 


d 
(s-!cot 0) f =0 (3.53) 
and 
(s-!cot 8) f; = 0 (3.54) 
dé .— ; 


Aside from the question of normalization, the solution for each is sin! 6. 
But normalization requires that it be multiplied by C, where 


C? if sin?'+1 9 do = 1 


Evaluating the integral, 
Pe 4 bigs 


Oo aan (3.55) 


110 The Potential Equation 


Let the normalized solution for a given / and m be designated ©,. 
Then what has just been shown is that 


/(21+1 ae 
01 = Gaya sin’ 6 (3.56) 


The normalized solutions for the same / and other m’s are then obtained, 
as may be seen from (3.46), (3.43), and (3.49), by the equation 


d 
a= £ + mcot6|@m (3-57) 
a eet lei pees 


The following are the explicit forms of these functions for /=0, /= 1, 
and / = 2. 


() 1=0 — Qw= v7 
Gi) l=1 0-23 sing 
0,.= 5 cos 0 
0,34= -3 sin 6 (3.58) 
(iii) 1=2 0.2 = _ sin? 6 
0.,= v5 sin 8 cos 6 
V5 


OQ = —— (3 cos? 0— 1) 
20 V2 


a -“ sin 6 cos 0 


V15 


mz 
q sin’ 


©,» ar 


The minus signs appearing in the solutions for negative m are purely 
conventional; since we determined only p,,” and g,,2 we might just as well 
have chosen them with negative signs for negative m’s. Were that done, 
we would have found in each case that ©, ., = ©i,m. This is, in fact, the 
more usual convention. 


5. Radial Solution and the General Solution of Laplace’s Equation 111 


4. Spherical Harmonics 


The normalized solutions to the angular equation (3.32) with A= 
I(i1+1) are known as the “spherical harmonics” of order m, and are 
designated Y,,,(6, 9). They are obtained by multiplying each of the func- 
tions ,,, by the normalized g-function, namely, 


1 


eine 


V2 


The 6-functions. with a somewhat different normalization which we dis- 
cuss later, are the so-called “associated Legendre functions.”’ Equation 
(3.35) with \ = /(/+ 1) is the “associated Legendre equation.” 


5. Radial Solution and the 
General Solution of 
Laplace’s Equation 


If we put \ =/(/+1) in the radial equation (3.31), it may be im- 
mediately solved (since it is an equation of the homogeneous type) to give 


R=Ayzyr! + Biylr™ (3.59) 


The general separated solution then has the form 


© = 


l 


[o-*) l B, 
YD (Aunt! + Fe) Yin 8, 9) 

=0 m=—l 

i a 


pee S (Anr!+ 32) Om(0)e%™ (3.60) 


1=0 m=-I 


This is a general form which must be satisfied by ail potentials in free 
space. 

The simplest solution for ®, which represents the potential of a unit 
point charge, is \ 

@=- (3.61) 
r 
where the origin is taken at the charge. This is, of course, a particular 
case of (3.60) with all .A’s equal to zero, as well as all B’s except By, which 
is unity. 

Potential problems are, in general, of two types: (1) those in which 
we know a charge distribution, and the aim is to find the resulting poten- 
tial (these are problems in the solution of the Poisson equation); and (2) 
those in which a potential must be determined which satisfies given boun- 


112 The Potential Equation 


dary conditions (e.g., ® is a constant on conducting surfaces). The general 
solution of problems of the latter sort depends on the geometry of the 
surfaces on which the boundary conditions are given; to get analytic 
solutions, it is necessary to find a system of coordinates in which the 
equation “separates” (that is, in which a solution may be found which is 
a product of functions of the separate variables), and such that the boun- 
dary surfaces are parametric surfaces, that is, surfaces on which one of 
the coordinates is constant. Spherical polar coordinates are therefore 
suitable for problems involving spheres (r = constant), planes (9 = 2/2 or 
@ = constant), or cones (9 =constant). The subsequent sections are 
devoted to the exploration of problems which can be solved in these 
coordinates. 


6. Legendre Polynomials 


In all potential problems in which there is axial symmetry, we may 
choose the polar axis in the direction of the symmetry axis; the potentials 
will then be independent of the angle vy. The angular dependence of the 
solution for a given / [see (3.52)] is @j. These functions are polynomials 
in cos 9; with an appropriate normalization, they are the “Legendre 
polynomials.”” The preceding section does not provide a very convenient 
way of studying these polynomials. We therefore proceed in a different 
way, which permits a straightforward derivation of some of their proper- 
ties. 

Consider the potential of a unit charge located, not at the origin, but 
at a point § at a distance z, above the origin on the z axis. (See Fig. 3.2.) 
The potential at P is the inverse of the distance SP, which is 

7 a ne (3.62) 
Vr? —2rz COS 0+ Zo" 
We henceforth write 
pL = cos 0 (3.63) 


Ifr > Zz, we write 
Go (3.64) 
rV1—2(zolr) w+ (Z/r)? 
and expand the inverse square root in powers of z)/r. The result is of the 
form © 
=-V “<e 3.65 
® 2, yiti P,(u) ( ) 
the P, being a polynomial of degree /. 
Equation (3.65) is, of course, an axially symmetric potential. Com- 
paring it with (3.60) in the special case in which A, = Brn, = 0 for m # 0 
we see that, to within a constant, 


P,(w) = On (9) (3.66) 


6. Legendre Polynomials 113 


Figure 3.2 


The expansion therefore gives us an alternative way of calculating 
Legendre polynomials. It simplifies matters a little to write the relation 
determining the Legendre polynomials in the form 


Seas =S UP, (u) (3.67) 


l=0 
where t < 1. 


It is easy to determine the Legendre polynomials of low order by the 
expansion: 
1 
a = Ii) ae) 
Vem ee 
+ 75 (Qty — t?)3 + FB (2tp — t?)4 
+ eee 


Picking out coefficients of the powers of t, we obtain the following formulas: 


Po(u) = 1, Pi(p) =p 
Po(w) = 3y?—-3, P3(w) = $y? —$ yu (3.68) 


Pa(m) = Fut— Fy? +H, 


114 The Potential Equation 


Equation (3.67) gives us a straightforward way to prove a number of 
properties of Legendre polynomials, 


(I)P,(1) =1 
Putting « = 1, (3.67) gives 


= = = P(A) 
V1-2r+r2 lot ae a 
which proves the theorem. 
1 
(II) EVALUATION OF [_P,(w)Po(u)du 


Because the P,’s are solutions of a Sturm-Liouville problem they 
are mutually orthogonal; that is, the integral is zero if / # I’. 
If we square Eq. (3.67) and integrate over jy, we find that 


7 du _ 1, i+t_ 5 [ : 
[. 1—2tpte in a “2! ff ()du 
Now the left-hand side may be expanded in powers of t, to give 
1, 1+t 2 2 ) 


= —_— — 72 4 = 34 2l it 
In i= 7 ett tet: +71! 2 
Equating coefficients, we see that the integrals are 
' 2 
P?(p) dp =—— 3.69 


Since the @,)’s are normalized to unity and the P,’s to 2/(2/+ 1) 
P(e) = 741 0 (3.70) 


(III) EVALUATION OF f | P,(u)dp 


This integral will appear in Problem (3.15) below. It may be evaluated 
by integrating (3.67) as it stands. The integral of the left-hand side is 


1 1 
—7Vi- 2p tP| =F Vite (1-9) 


which may be expanded in a power series to give 


1+ 5 (12) a 


where (1/2) is the coefficient of z” in the expansion of (1+ z)"”. 


6. Legendre Polynomials 115 


Therefore 


1 
fo Pludu=1, 1=0 
= 0, leven but ¥ 0 (3.71) 
=(12) [/=2n—-1 
(IV) A RECURSION RELATION 


Differentiating with respect to t both sides of Eq. (3.67), we obtain 
the equation 


wat ee 
oro = It P(e) (3.72) 
=r 
which may be written 
aot regi 
Se (LS 2 er lt''P 
or 


(u—t) Dy YP)(w) = (1-2 +f) > I'"P,(w) (3.73) 


If we now equate, on the two sides of this equation, the powers of t', we 
find that 


—Pi-y+pP, = (+1) Pi4.—2ulP,+ (l—1)P,_, 
which may be rearranged to give 
(21+ 1) uP, = (c+ 1) Pray +IP)_1 (3.74) 


This recursion relation, considered as an equation for P;,, in terms of 
P, and P,_,, provides an alternative means of calculating the successive 
polynomials. 


(V) A DERIVATIVE FORMULA 


There are also many different relationships between Legendre poly- 


nomials and their derivatives. To obtain such relations we first differen- 


tiate (3.67): ; - 
—>— 3 = ¥ UP) () 
(1 — 2tp + 27)? 2 l 

or 


t> UP, = (1-2tut+F) > Pp; 
Equating coefficients of each power of t gives 


P, = Pix, —2uP} + Pr, (3.75) 
Differentiating (3.74) gives a second relation 
[+1 l : 
P,+pP; = apy Pit apy Pies (3.76) 


116 The Potential Equation 


Solving for Pj,,, Pj_-; in terms of P; and P;: 


Pi_-, = —(P,+ uP; (3.77) 
and 

Ping = (I+ 1)P, + uP? (3.78) 
from which the relation 

(20+1)P, = Pi4.—Pi-1 (3.79) 


follows. 
Another relation, which expresses P; in terms of Legendre func- 
tions, is obtained as follows: Multiply (3.77) by yu to get 


p?P, = wPi_, + leP, 
Substituting for uP;_, from (3.78): 


e?P; => P; —IP,_, + lpP, 
Using (3.74), 


l+1 
(we 1)P; eee (Pi4,— Pi-1) (3.80) 


On differentiating (3.80) and using (3.79) we obtain directly the differen- 
tial equation for P;: 


7 [(1— 2) Pi] +1(1+1)P, =0 


By combining the relations already derived, numerous other equa- 
tions involving the Legendre polynomials and their derivatives may be 
derived; however, (3.74), (3.79), and (3.80) are the most commonly 
used ones. 


7. An Alternative Derivation of 
Legendre Polynomials: 
Multipoles 


The potential at a point P of a unit point charge at a point on the z axis 
at a distance z, from the origin may be written in terms of the Cartesian 
coordinates (x, y, z) of P; it is 


a a (3.81) 


This may be expanded in a Taylor’s series about the value z, = 0: 


vy w"{ a" 


7. An Alternative Derivation of Legendre Polynomials: Multipoles 117 


A derivative with respect to Z) of a function of z— zy is the negative of its 
derivative with respect to z. Therefore, 


> eee (5) (3.83) 


n=0 . 
where 
P=~2+y+z2 (3.84) 


If this equation is compared with (3.65), it is seen that 


mon P,(w) = =Cl = (: ) (3.85) 
or 
cr +1 1 
Pi( pw) = r! 37 z( (3.86) 


Thus, the Legendre polynomials may be derived by successive differen- 
tiation with respect to z of 1/r. 

It is, of course, obvious that the derivatives of 1/r are solutions of 
Laplace’s equation. It is possible to interpret the various terms as being 
the potentials of multipoles of successive orders. 

The first derivative represents a dipole potential. If we take a charge 
—eatz—=—eandachargee at z = e, the potential is 


PD, = 7 + (3.87) 
Ve+y?+(zte)? Vxt+y?+ (z—e)? 


The “dipole” is obtained by letting e — © and e — 0 in such a way that 
2e€ —> p,, the “dipole moment.” The potential becomes, in this limit, 


| 
lim ®, =—p, 2 (7) (3.88) 


This dipole has, of course, a direction associated with it, the z axis, along 
which lies the line joining the two charges. 

A pair of dipoles of opposite moment (yu, at z=€, —p, at z=—€) 
constitute a “quadrupole” when e€— 0, wu, — © in such a way that 
2€u, = Me, the “quadrupole moment.” The quadrupole potential is 


0” (1 
DP, = eu, az (7) 


_ #(l 
— Me az. (5) (3.89) 


118 The Potential Equation 


The procedure may be extended a step at a time, an “octupole”’ being 
the limit of two quadrupoles of opposite moment, and so forth. In fact, an 
inductive argument permits generalization to the statement that the 
potential of an n-pole is 


0, = 1)" 2 (5) (3.90) 


Using (3.86) this may be written 


®,, = n\n Gaz Py (COS 6) (3.91) 


Problem 3-1: Calculate the potential and multipole 
moments of a uniform linear charge distribution extending from —/ to /. 


8. Multipoles without Axial 
Symmetry and Associated 
Legendre Functions 


It is, of course, true that not only derivatives of 1/r with respect to z 
are solutions of Laplace’s equation, but also derivatives with respect to x 
and y, or any combination of derivatives with respect to the three co- 
ordinates. If / derivatives are taken altogether, and the result expressed in 
polar coordinates, the r dependence will be 1/r'+!. Comparing with (3.60), 
it is evident that this solution is, then, some combination of spherical 
harmonics of order /. Conversely, each spherical harmonic must be some 
combination of derivatives of order /. To express the spherical harmonics 
in terms of derivatives, it is useful to introduce the operators 


44;2 


~~ ax ay (3.92) 


and to consider as our three basic derivative operators not 0/dx, d/dy, and 
d/dz but d,, d_, and 8, = d/dz. 
A general solution of order / is 


04?0_*0,'-? § (7) (3.93) 


where p+s <1. If, however, use is made of the fact that 1I/r satisfies 


Laplace’s equation (except, of course, at r=0), in calculating these 
derivatives we may use the relation 


040- =— 0,” (3.94) 


8. Multipoles without Axial Symmetry 119 


This may be used to eliminate entirely either the 0, or d_ operators, 
depending on which occurs less frequently. Suppose, for instance, that 
s < p. If we put 

p=s+t+m (3.95) 


we see that the solution is 


fin = 0,”9,/-" (3) (3.96) 


where we have omitted the signature (—1)* in front. If we had had 
s=p-+m > pwe should have had, in place of (3.96), the solution 


| ro — ga!" (7) (3.97) 


It may now be shown that, aside from arbitrary multiplicative con- 
stants, fi, is the same as Y,,,, and f,,-m aS Y;,-m. This may be most simply 
verified by showing that the dependence of the solutions on the polar 
angle ¢ is e’”? or e~’"*, respectively. 

It is not too difficult to show by induction that 0.’"(1/r) may be 
written 

ds.” (*) = (x+iy)"Pm(r) (3.98) 
where p,,(r) is a function only of the radial coordinate, which will be 
determined later. For suppose it to be true for a given m. Let us then 
operate once more with 0... Clearly 


04 (xtiy)" =0 (3.99) 
whereas 
Pm (r) === ® py (r) (3.100) 
Therefore, 
a." (7) = (xtiy)"" «ph (r) 
= (xtiy)"" pms (7) (3.101) 
where 
Pmt => Pn (3.102) 


Since (3.98) is clearly true for m = 0, with py = 1/r, the result is valid for 
all m. Moreover, using (3.102), it is easily shown that 


m ! 
Pm(r) = (-5) Cn a (3.103) 


120 The Potential Equation 


Once (3.98) is proven, the dependence of (3.97) on ¢ follows im- 
mediately. For 


0." (5) = yp!" sin” @ e=ime’y (r) (3. 104) 


But if (3.98) is now differentiated (/— m) times with respect to z, the de- 
rivatives pass through (x+iy)” and operate only on p,,(r); this yields a 
function only of z and r, that is, of r and cos 6. Therefore, these further 
differentiations do not affect the g dependence of the solution. 

Looking now at f,,, from (3.96), we can conclude, because of its r and 
y dependence, that 


04”9,/"™ (4) = Cn ar Onn ei”? (3.105) 
a. m9, I—m (7) = Cim Ta ae On e —ime (3. 106) 


That © is unchanged when m is changed into — m in the y-dependent term 
is clear from the fact that the equation satisfied by ©,,, depends only on m?. 

These @,,,'s were defined to be normalized to unity. The “associated 
Legendre functions,” on the other hand, are differently normalized. They 
are in fact defined by the relations 


ax”8 in(s *)= (= 1)'(I—|ml)! 5 Pi"! (cos 8) e*™ (3.107) 


It will be noted that, with this definition, P,'"' (cos 6) reduces to the Le- 
gendre polynomial [Eq. (3.85)] for m = 0. 

To determine the precise numerical relationship between @©,,, and 
P,'"' it is necessary to obtain the normalization of P,!”'. It is shown in 
Appendix 3A that the following recursion relation holds: 


(21+ 1) uP," = (l—m+1)Pit t+ (d+m)Pey (3.108) 


This may be used to evaluate the normalization integral as follows: 
multiply (3.108) by P”, and integrate over ~« = cos 6 from — 1 to 1: 


1 
(21+ 1) f. wP"P™, du = (l+m) i (P”,)2dp (3.109) 
We have used the fact that 
[i PePy" du =0 (3.110) 
-1 


for / ¥ I’, which follows from treating the differential equation (3.35) as a 
Sturm—Liouville problem, on the interval (— 1, 1). 


8. Multipoles without Axial Symmetry 121 


If, now, on the left-hand side of (3.109) we substitute for P/”, from 
(3.108), we get 


(=m) Sty | Pea) de = +m) fo [PH (a) de 


(3.111) 


We may use this recursion relation repeatedly until / 1s reduced to the 
value m: 


; 2m+1 (I+m)! : 
[P:"(“) |? du = oa | bee GO Pap 
[. 21+1 (2m)!(l—m)! [ (3.112) 


Now, from (3.107) in the case /= m, using (3.103) and (3.104), we see 
that 


mn 2m 1 
(= 1)* pitt Pr! (w) en? = = sin” 0 ee (- 5) Crs pint. 
Therefore, 
Pp" (“) = an ee sin” @ (3.113) 
But 
7 Jzm+ ! 2 
i} sin?”"t+1 9 dg = eee (3.114) 


We now have all that is required to evaluate our normalization integral: 


Bead _ 2 (l+m)! 
- LP; (u) 1° du = a5 (=m)! (3.115) 


is = 2 (l+m)! 


Finally, the normalized spherical harmonics can be expressed in terms of 
the associated Legendre functions: 


Y,;""(0,¢) = \/ re yi Pim(cos 0) etme (3.117) 


It is easily seen that Y;” and Y,?" are orthogonal, in the sense that the 
integral over solid angle of the product of one with the complex conjugate 
of the other is zero. For 


It foilows, then, that 


122 The Potential Equation 


J ¥" (6,0) Y#"* (0, ¢) sin 0 d0 de 


— (C24) Cem)! 2 +1) Cam) nny Pee 
a ae ae (re = zal jp (a) PHM) dye 


x J 3 eftn—m'e dy 
0 
where yu = cos 0. The ¢ integral gives 2775nm:. The u integral becomes 


1 
J, P,"(w) Pi? (me) du 


which is zero unless /=I' because the P,”’s are solutions of a Sturm- 
Liouville problem for a given m. But when /=I' we can evaluate the 
integral form (3.115). Putting in the value of the integral given there we 
get altogether 

(21+1) (l—m)! 2 (l+m)! 


4 (1+m)!-"2+1(i—m)! | 


It follows then that the Y,’"’s are orthonormal, that is, that 
i] Y,""(6, 9) Yi" * (0, ¢) dO. = 88am (3.118) 


where the element of solid angle d{ is equal to sin 6 dd do. 


9. An Addition Theorem for 
Spherical Harmonics 


In Fig. 3.3 let r and r’ be the position vectors of points with polar 
angles (0,¢) and (6’,¢’), respectively, and a the angle between these 
vectors. 

Consider the expansion of a function of direction F(Q.) = F(6, ¢) in 
terms of spherical harmonics: 


F(0,~) =X AmYi" (9, ¢) (3.119) 


By the orthonormality theorem (3.118), we can find A;m by multiplying by 
one of the Y,’"’’s and integrating: 


Aim = { F(8',9")¥:" (8, @") dO’ (3.120) 
Substituting in (3.119), we obtain the equation 


F(9,9) = J F('.¢') & Yi"(8.) ¥i""(6',e') dQ’ (3.121) 


9. An Addition Theorem for Spherical Harmonics 123 


Figure 3.3 


This shows that the sum 
»> Y;"(8,¢)¥:" (0',¢') 


is a 6-function, which we may designate 6(Q—)’), which, when inte- 
grated over solid angles, gives unity. Equation (3.121) becomes 


F(Q) = J F(0')8(Q—9Q") dQ’ (3.122) 


Consider this equation now in terms of a new set of coordinates, in 
which the polar axis is chosen along the direction (9,¢). Then Y;"(9, ¢) 


becomes Y,”"(0, ¢) = V(21+ 1)/4a 8no by virtue of (3.117). Then the sum 


> Y"(0,¢)¥;"" (6',9’) 
becomes 


7) ae eee 
> Va, i" (a7) 
l 


124 The Potential Equation 
where (a, y) are the polar angles of the direction (0’, ¢’) relative to the 
direction 0 as polar axis. Using (3.117) again, the sum becomes 


21+1 
ps “Aq Pt (cos a). 


Thus, in these coordinates, 


21+4 
F(o) = 5 4+! | F(0')P, (cos a) dO’ (3.123) 


Since the right-hand sides of (3.121) and (3.123) are equal for all functions 
F(Q’), it follows that we must have 


21+ 1 as ’ r) 
p> 4, Pi (cos a) = }) ¥,"(0.¢)¥;"" (8, 9") (3.124) 


tom 


By virtue of the orthogonality of the spherical harmonics, it follows that 
Y,'""(0, ¢) = at | P, (cos a)Y;'"(0', 9’) dQ’ (3.125) 
and also, of course, that 
Y;""* (6, ~) = aa | P, (cos a) Y;""* (6’, ~’) dQ’ 


If we now make an expansion of P, (cos a) in the harmonics Y,’"(6', 9’) 
with (6, ~) fixed, the coefficients are 


4 as ' , * 4 
| Pecos a) Y""" (6', 0’) da’ = Y;""" (6, ¢) ai 
Thus, we have the “addition theorem” 
P = _ > yoo) ¥,"" (6 3.126 
MOOS) = Dya, 2 1" ( ,y’) l (0. ¢) (3. ) 


This merely says that in (3.124) the harmonics of each order may be 
equated. 

If we substitute for the Y’s in terms of associated Legendre functions, 
(3.126) becomes 
= ‘ (l—m)! 
P, (cos a) = P, (cos @) P; (cos 0’) +2 $ Wem)! 


m=] 


P," (cos 6) P;'" (cos 6’) 


cos m(g—¢’) (3.127) 


10. Potential of a Given Charge Distribution 125 


Problem 3-2: A general change of axes can be de- 
scribed in terms of ““Euler’s angles,’ which define the transformation from 
old to new axes in three steps: 

(a) rotation through angle ® about the z axis 
(b) rotation through angle @about the new x axis 
(c) rotation through angle V about the position of the z axis after (b). 


({) Show that the first rotation results in the transformation 
gy =~e-® 
with 6 fixed, so that 
Y;"" (6, vy) —_— emey ym (0, yg’) 


(ii) Show that the second rotation makes the transformation between 


derivatives in the primed axes and those in the new (double-primed) axes 
as follows: 


04 =3(1+cos@)0f +3(1—cos@) a" —isin@ 0, 
0. =3(1—cos@)a".+3(1+cos@)d” +isin@d; 
d, = (1/27) sin ®(04. — 0%) +cos@0z 
(iii) Use the above results to prove the following transformation: 
29-— . . 

P, (cos 6) = acos*—'P, (cos 6”) +sin 6 cos 9 P,} (cos 6”) sin (g” + V) 
— 4 sin? 6 P,? (cos 6”) cos 2 (9” + V) 

where (6”, ¢”) are the polar coordinates relative to the final set of axes. 


10. Potential of a Given Charge 
Distribution 
Consider an arbitrary charge distribution p(r); let us try to calculate 

the potential of the electrostatic field which it produces. A formula may be 
written down for this potential making use of a physical argument. For the 
charge distribution may be considered to be made up of elementary parts, 
the potentials from which may be added to give the required potential. 
Since the distribution is continuous, the summation Is a continuous one, 
that is to say, an integral. It is, in fact, 


®(r) = | PE) ay (3.128) 
lr—r’| 


We may obtain the expansion of this potential in spherical harmonics by 
first writing 
| 


lr—r'| 


— < 
7 > +1 P, (cos a) 
I=0 ° > 


126 The Potential Equation 


where a is the angle between r and r’, and then using the addition theoren 
(3.126): 
® = 4a ! re y,m* Poot\ 2 ae ym 
= mA p(r') Star Yr" (8", p')r'?dr'dQ'Y,"(9, @) 
l a td 
— S oT aie Ti [oc yr" Y"* (0', o' )r’2dr' dQ’ 


loam 


+r! i Ort Yi" (0, 9" )r'tdr' do | Y;"(0,¢) (3.129) 


It should be observed that this has the general form (3.60). 
If p(r’) = Ofor |r’| > R, then forr > R we have 


o= aa | or r"'¥""(0',p')de'—e 3.130) 


Mm 


The integrals are defined as the “multipole moments” Q,"* of the distri- 
bution: 


. Y7"(8, 9) 
=> VY 5 oi" a (3.131) 
where 
4 ’ ' m ' ’ , 
QO," = Vay ‘ p(r’)r’'¥,"(0', p') dr (3.132) 


As we have seen earlier, the various terms in (3.131) are all potentials of 
multipole distributions. 


Problem 3-3: Calculate the potential of a hydrogen 
atom in its ground state, which consists of a nucleus of charge +e and 
an electron charge distribution of density 


hs 
p(r) mF ar ae 


Problem 3-4: Calculate the potential of a hydrogen 
atom in an excited 2 p state, in which the electron distribution is 


p(r) = ea gs r2 cos? 8 e7/% 


Problem 3-5: Calculate the multipole moments of a 
distribution of four charges of alternating sign (— €, + €) at the corners of a 
square. 


11. Potential of an Axially Symmetric Charge Distribution 127 


Problem 3-6: The electrons in inert gas atoms have 
charge distributions 


p=filr)|¥1"(9, 9)? 


there being two electrons for each (/,m). Verify that the total charge 


density Is 
2> filr) 


11. Potential of an Axially 
Symmetric Charge Distribution 


If the charge distribution has an axis of symmetry, and we choose the 
axis of symmetry as the polar axis, it will be possible to expand the charge 
distribution in Legendre polynomials: 


2/' +1 
4a 


oy (r') = tt | p(r')P, (cos 6’) dQ’ (3.134) 


Noting that V (2/+1)/4a P,(cos 6’) = Y,°(0',¢’), substituting (3.133) 
into (3.132), and using the orthogonality of the Y’s, we see that 


O;"" = 0, m0 
In fact, (3.131) and (3.132) then become 


p(t’) = > pr (r') Pr (cos 3’) (3.133) 
7 


where 


= P, (cos 0 
>= > or = af ges (3.135) 


where 
An ’ t t 
+1 2 | oe )r''P,(cos 9) d*r (3.136) 


Suppose now that by some direct means it is possible to calculate the 
potential on the axis of symmetry, that is, for cos 6 = 1. We have seen 
earlier that P;(1) = 1, so 


Paxis =2 41 Oo aH a (3.137) 


Thus, if the potential on the axis of symmetry is known, expansion in 1/r 
gives the multipole moments. Then, from (3.135), the potential is known 
everywhere; one has only to multiply each term in the expansion of ®,,;, 
in powers of 1/r by the appropriate Legendre polynomial. 


128 The Potential Equation 


If we designate the density of the axially symmetric distribution as 
p(z',&), € being distance measured perpendicularly from the symmetry 
axis, itis clear from first principles that 


= i dé dz’ 
Viz=Z ore 


where, on the axis, z = r. (See Fig. 3.4.) 


(3.138) 


Figure 3.4 


Consider a ring of radius a with a total charge e; the charge per unit 
length is then e/27ra. If we take the origin at its center, its potential at a 
point distance z from the origin on the axis of symmetry is 


€ 


oS 2 © = 3.139 
a (3.139) 
If z > a, we may expand 

her 2n | 2 

>> ala) (3.140) 


where (~1/*) are the binomial coefficients in the expansion of (1+ x)7"?. 
The potential is then 


1/2 2n 
> (“> ) Sor Pon (COs 8) (3.141) 
If, on the other hand, z < a, the potential becomes 


1/2 
=e SC. \ on mitt Pon (COS 8) (3.142) 
n=0 
As another example, consider the potential of the charge distribution 
on a conducting disk of radius a and containing a charge e. The charge per 
unit area on the surface is known to be 


p(g) = (3.143) 


11. Potential of an Axially Symmetric Charge Distribution 129 


The potential on the axis is, if the origin is taken at the center of the disk, 


d 144 
o=>5, | =a veut (3.144) 


Putting a? — €? = u? we get 


] 
@=£( ————d 
a ii Vae+22—u? 
a 


Vaet+2? 


€. 
=—sin 
a 


6 pg a 
= —tan™' — 
a 


|z| 
Forz<a 


= me eqn Zl te 
0= FS CO Ter 
and for z >a 


: 1. 
v=o 2 (—1) oe es 


Putting in the appropriate mea polynomials, 


(— 1 aie penti 


D = aa a “n+l gente gente Pon+1 (cos 0) (3. 145) 
whenr < a, and 
=> ee Fr Pan (COS 8) (3.146) 


whenr > a. 


A word of caution about (3.145): although the potential is clearly 
symmetric on opposite sides of the disk, that is, symmetric in the trans- 
formation 6 — 7 — @, this does not seem to be so for (3.145). Going back 
to the formula for the potential on the axis, however, we see that when 
z—-—z all terms in the sum change sign. This, then, shows that, for 


6 > 7/2, the sign of the sum in (3.145) should be changed, so that the 
potential retains its symmetry. 


Problem 3-7: Calculate the gravitational potential of 
a thin disk of uniform mass o per unit area. 


Problem 3-8: Calculate the gravitational potential of 
a homogeneous hemisphere of density (mass per unit volume) 6. 


130 The Potential Equation 


12. Potentials of Charge 
Distributions under Various 
Boundary Conditions — Green’s. 
Functions 


In (3.128) the potential of a charge distribution of density p(r) was 
given as , 
®(r) = | o(r') der’ 
irr’ 
This was valid in the absence of any boundary conditions; that is, for a 
charge distribution in an infinite empty space. The function 


! 
jr—r"| 


G(r-r')= (3.147) 
is known as the Green’s function for this problem. The Green’s function 
is simply the potential of a point source under the boundary conditions (or 
lack of them) applying to the problem. 

Let us consider now the potentials produced by charges when these 
potentials must satisfy various simple boundary conditions. 


(I) PLANE BOUNDARY CONDITIONS 

Suppose that we have an infinite plane conductor at z=0. The 
potential is then subject to the boundary condition that 6 = 0 on z= 0. 
Consider now a charge at r’ = (x’, y’,z’) where z’ > 0. If there were an 
equal and opposite charge at (x’,y’,—z’), which we call the “‘image point,” 
then the potential would in any case be zero on the plane z = 0; if, then, 
we took the conducting plane away, there would be no change of potential. 
Therefore, the potential is 

1 1 
® =r =e (3.148) 

where r” is the “image” of r’ in z= 0. This is the ““Green’s function” for 
the boundary conditions of the problem. 

If there is a charge distribution p(r’) (lying entirely in the region 
z > 0) its potential will then be 


=| r’)| l | ar (3.149) 


jr—r’| |r—r" 


If, in the second term, we introduce —z’ for z’ as a variable, (3.149) 
becomes 


l 
r—r" 


= i [o(r') — p(r")] 


p(r”) is then an “‘image charge distribution.” 


d*r' (3.150) 


12. Potentials of Charge Distributions 131 


Consider next the case in which the charge is placed between two 


parallel conducting planes, located, respectively, at z = — a/2 and z = a/2. 
= +] - +] - +] — 
F D|B z’ | A C\E 
z= —a/2 z= a/2 
Figure 3.5 


We may satisfy the boundary condition at z= a/2 by putting an image 
charge at A, where z = a—z’. But if we now wish to satisfy the boundary 
condition at — a/2 images both of the original charge and that at A must be 
introduced. This means a negative charge at —a—z’ (point B) and a 
positive one at D (— 2a+z’). This then in turn requires further images in 
z=al2, positive at C (2a+z’), and negative at E (3a—z'), etc. Con- 
tinuing in this way, it is seen that the potential satisfying the boundary 
conditions is that which has positive image charges at the points 


r, = (x',y’,z’ +2na), N=—O,...,0 


and negative ones at 


r, = [x', y’,—z'+(2n+ l1)a] 
The Green’s function is then 


1 1 
G=). |) 3.151 
an) Zr es 
It should be noted, incidentally, that this potential shows a periodicity in 
z with period 2a. 

It is not essentially more difficult to deal with the case of a charge ina 
rectangle, with boundaries x =+a/2, y=+b/2, z==x+c/2. The image 
system now forms a three-dimensional lattice. Taking the charge at the 
point (x’,y’,z’) inside the cube, we can, by analogy with the one- 
dimensional case, take 2 X 2 X 2 = 8 charges, four of each sign, in a cell of 
dimensions 2a X 2b X 2c, and then repeat each cell by periodicity, the 


132 The Potential Equation 


periods in the x, y, and z directions being 2a, 2b, and 2c. The eight basic 
charges, with their signs, are 


+ at (x’, vy’, z’) 

—at (—x’+a, y’,z’) 

—at (x’,—y’+b, 2’) 

—at (x’, y’,—z’ +c) 

+at (x’,—y’+b,-—z'+c) 

-at(=x +a, y';=—z Fe) 

+at (—x’+a,—y' +b, z') 

—at (—x’+a,—y'+b,—z' +c) 
While these Green’s functions have quite complicated analytical form, 
they are not difficult to compute numerically on a digital computer. 


(11) SPHERICAL BOUNDARY CONDITION 

Suppose that a charge is placed outside a conducting sphere of radius 
a. Let us first consider such a charge on the polar axis (or rather, choose the 
polar axis through the charge). 

Going back to (3.65), the potential of a unit charge at r’ (>a) ata 
point r (<r’ but >a) is 


tl 
®, = > TH P, (cos 6) (3.152) 


Suppose that we take an image charge of strength — a at the point r’ (< a). 
Its potential is 


vl 
®,=—a Sa P, (cos 8) (3.153) 
These can give a net potential zero for r = a if, for all l’s, 
a! yr! 
Si = OH (3.154) 


This is clearly possible if a = a/r’ and r’ = a?/r’. 


Problem 3-9: Outside a grounded sphere of radius a, 
there is placed a line containing charge o per unit length extending radially 
from r= b to r=c. What is the net potential (a) outside the sphere and 
(b) inside the sphere? 


Answer: 
-.,C—rcos@é_.. ,_,b—rcos@ a... _,cr—a’cosé 
o| sinh~! ——.——_— sinh7! ——.——— = sinh“! —_,..——_ 
rsin @ rsing r a’ sin @ 
a .,_,br—a’*cos@ 
+—sinh bra cos 6 
r a’ sin @ 


12. Potentials of Charge Distributions 133 


outside and zero inside. This may be written alternatively as 


c—rcosé@+Vc?+r?—2crcos 6 
b—rcosé+Vb?+r?—2brcos 0 


a. cr—a’cos0+ V c?r2+ a*—2cra” cos 6 | 


=o|1n 


—-In 
r br—a’?cos0+V b'r?+ at—2bra? cos 6 


Observing now the physical content of this result, we see that the boun- 
dary condition (zero potential on the sphere r=a) 1s satisfied if each 
charge e outside of the sphere at coordinates (r’, 6’) is matched by an 
image charge — (a/r’)e inside with coordinates (a?/r’, 6’). If, on the other 
hand, the potential of the sphere is a constant ®), we must add to the po- 
tential of the two charges a term ®, a/r. 

It is then possible to write down the ““Green’s function” correspond- 
ing to these boundary conditions; it is 


] a ] a 
G,(r,r') = ———— - — +2, - 3.155 
Ook) lr—r’|  r’ |r —(a?/r’?)r’| °r { 


The potential due to a charge distribution p(r’) outside the sphere is then 


®(r) =| p(r')G(r,r') dr’ (3.156) 
when r is also outside the sphere. 


From (3.128), we know that the potential due to a charge distribution 
p(r) can be written 


-_ l 
®(r) = S| rete’) Pi(cos «) dx! (3.157) 
1=0 
a being the angle between the vectors r andr’, and re, r, are, respectively, 
the lesser and greater of r and r’. Let us make, in this formula, the 
substitutions - ; 
a 4c 8A 
my =7Yr, m= ar; (3.158) 
that is, let us write (3.157) in terms of the image points of r andr’ in the 
sphere r= a. Since the angle between r,, rj is the same as that betweenr 
and r’, cos a can also relate to the former pair of vectors. The substitution 
(3.158) into (3.157) gives 


a? = Pri fa 2) ae. , 
o(Z.0)=3 2 p (S.0') Pi(cos a) a'r; (3.159) 


> gi 
-' =o? 4% Mi> 


where (1, 0’ stand for the polar angles of r and r’ respectively. This may 
be written 


2 = l 2 5 
o(=,0) a> | 350 (7.0') (5) P,(cos a) d3r’ (3.160) 
1 1 


a l=0 V'1> 


134 The Potential Equation 


From this it follows that 
a a? 
—@® (E. a) = ®, 
ry ry 


is the potential generated by the charge distribution 


_(a\>_ (a? 
Thus, in general, if ®(r) is a solution of the potential equation, so is 
a, (a’ 
6, = 40 (Sr) (3.162) 


Given any potential, inversion in a sphere may then be used to generate 
another potential; the relation between the corresponding charge distribu- 
tions is given by (3.161). 


Problem 3-10: Show that the Green’s function for the 
region lying outside a hemisphere of radius a is the sum of infinite- 
medium Green’s functions corresponding to sources at radius r, and at the 
three image points: (i) the image of ry in the full sphere and (ii) the nega- 
tive images of the two preceding sources in the plane bounding the 
hemisphere. 


(III) CHARGE ON CONDUCTING SURFACES 
It is worth noting that, from a knowledge of the potential, it is easy to 
find the charge distribution on the surface. For let us apply Gauss’ 
theorem 
[v-Dav={ D-nds (3.163) 


to a volume shaped like a “pillbox” bounding an area dS on the surface, 
and bounded by normals to the surface around the circumference of dS 
and by surfaces parallel to dS and a vanishingly small distance on each 
side of it (see Fig. 3.6). The theorem converts the integral of V-D over the 
enclosed volume, into an integral over its surface. The edge contributions 
can be made arbitrarily small. The surface integral is then 


(D-n), dS—(D-n)_ dS 


where ( ) designates an average over dS, and + and — denote the 
surfaces in the direction n and opposite to it, respectively. But by (3.14) 
the volume integral is (a0) dS where a is the charge per unit surface area. 
Thus, letting dS — 0, 
o=(D-n),—(D-n)_ (3.164) 
Since, by (3.18) and (3.22) 
1 (3.165) 


12. Potentials of Charge Distributions 


Figure 3.6 


=~ Fe on), (Gn) 


where d®/dn is the normal derivative n - V®. 


it follows that 


(3.166) 


Problem 3-11: A charge e is placed at a distance a 
from an infinite conducting plane kept at zero potential. Find the surface 
charge distribution on the plane. 


Problem 3-12: Using (3.153), show that, if a charge e 
is placed at a point with r= b > a and 9 = 0 outside a sphere of radius a, 
the charge distribution on the surface is 


on) 
4a la? b(a?+b?—2abp)?? 


where yw is the cosine of the polar angle 0 of the point on the sphere. 


(IV) SOME RELATIONS BETWEEN GREEN’S FUNCTIONS 

In this section we obtain relationships between the following Green’s 
functions: 

(1) That of a point source of unit strength. 


(2) That of a uniform laminar source of unit strength per unit area. 
(3) That of a spherically symmetric shell source of unit strength. 
These relationships are not restricted to potential problems, but apply 


135 


136 The Potential Equation 


Figure 3.7 


equally to all problems in which the contributions from different sources 
are additive, that is, all linear problems. 

It is clear that if the solution for a point source is known, that for a 
uniform laminar source may be obtained by integration. Designating the 
point source solution by ®,,(r) and that of a laminar source by ®,,(z) 
(the source being at z = 0), Fig. 3.7 shows that 


Pp (z) = ia Op( Vaz FZ) 2a x dx 


Putting, in the integral, r = V x?+ 2’, the equation becomes 


®pi(z) = [Dye (r)2arr dr (3.167) 
Or 
Pp (r) = — pa (3.168) 


In the form (3.168) the theorem is particularly useful, since it shows that 
the solution of a one-dimensional problem provides the basis for solving 
the most general three-dimensional problem. 

A trivial example is the potential problem, for there the one-dimen- 
sional homogeneous equation is 


which gives ®,, = A+ Bz. Equation (3.166) gives a boundary condition 
for unit surface charge: 


12. Potentials of Charge Distributions 137 
so that B = — 277. Equation (3.168) then gives 


1 
Pp (r) = - 


as expected. 


It is further evident that the point source solution may be used to 
generate the shell source one. Simple geometry shows, on reference to 
Fig. 3.8, that 


PQ = Vitti tex 


where x is the distance OR. The area of the shaded strip is 27r’dx, and 
its source strength is (1/42rr'2)2ar' dx = (1/2r’)dx. Thus, the solution 
due to the shell source is 


®,,(r) = oe [ Dpe( Vre+r?— 2rx) dx 
Putting 
r2+r'2—2Orx = & 


this may be written as 


ee ae ae 
Pal =a | Pnlee dé (3.169) 
Reference to (3.167) shows that this can be put in the form 
1] ; ’ 
Palr) =Z il Palrtr’) + Op(lr—r'|)] (3.170) 


With this result, problems of spherical symmetry can also be obtained 
from the one-dimensional Green’s function. 


~~ 
.) 


e 


TIL ALL ee OS 


nn 


Figure 3.8 


138 The Potential Equation 


Problem 3-13: The Yukawa equation is 
Vip— pp = 0 


If the source strength of a unit source at the origin is —47g6(r), show by 
first solving the one-dimensional equation that 


—pr 
Wo (r) = & 7 
25128 —Kr—r'| _ o—ertr’) 
2prr 


Problem 3-14: Show that the solution of the Yukawa 
potential for a uniform source of strength n per unit volume inside the 
sphere r = ais 


4 fea F cosh pa sinh pa| 
Ten —— |— saa 
npr Le pe? | 


forr > a. Find the form for r < a. 


13. Further Problems Involving 
the Potential Equation 


In this section we propose several boundary value problems involv- 
ing potentials or the potential equation, and indicate, without providing 
all the details of calculation, how they may be solved. The completion of 
the calculations will give the reader valuable practice in the use of the 
techniques of this chapter. 


Problem 3-15: Consider a conducting plate of large 
extent (so that edge effects can be neglected), from which its cut a circu- 
lar hole of radius a. (See Fig. 3.9.) In this hole is attached a hemi- 
spherical conductor, also of radius a. The plane is charged so that, far 
from the sphere, the charge per unit area has a constant value oo. Find the 
potential, and the charge density on and near the sphere. 


13. Further Problems Involving the Potential Equation 139 


Figure 3.9 


If we take the plate to lie in the plane z = 0, and the polar axis through 
the center of the sphere and perpendicular to it, the solution can be 
written, forr > a, 


= B 
© = > (4r +35) P, (cos 9) 
= 


(a) If the condition is applied that the surface charge density has the 
constant value o, far from the hole, it is seen that the only nonzero A 
coefficient is A,, and that it must be 


A, =—2705 (3.171) 
(b) From the condition that 6 = 0 for r = a it follows that 
B, = 270, a’, B, = 0, Ll | (3.172) 


Thus, 
a 
® =— a cos 9 


(c) The general form of the charge density on the plate is then 
a® 
o = 01-5) (3.173) 


(d) The charge density on the sphere is 


o =%0,cos 0 (3.174) 


140 The Potential Equation 


Problem 3-16: Two hemispheres of radius a, separ- 
ated around their common circumference by a negligibly thin strip of 
insulator, are kept at potentials +4 V , respectively. Find their potentials 
and the charge distributions on them. 


The potential may be written, forr < a, 


®,= 3 A,r'P, (cos 8) (3.175) 
'=0 
andforr >a 
=> Ft, P, (cos 9) (3.176) 


=0 


where @ is measured from the axis of symmetry. At r= a they reduce to 
the form 

®, = > C,P, (cos 8) (3.177) 
where 


A=, By=Ca"* (3.178) 


To find C,, multiply (3.177) by P, (cos 6) and integrate from 0 to 7. 
(a) Using Eqs. (3.69) and (3.71) the nonzero C’s are 


Coay = ae 3 Val i) (3.179) 


A few values are 
Ci = Vo, C; = —igV,, Cs = 32Vo, C, = — EV, 
(b) The charge density on the surface is 


o=- ina (+1) 41+3)(17 \Pa(e) ——B.180) 
(c) The field at the center (r = 0) is 
1 3V, 
=—-Foa (3.181) 


(d) The dipole moment of the system is 
Q,° = 3Voa? (3.182) 
Problem 3-17: In a conducting material there is a 


spherical hole of radius a. Far from the hole a uniform current Jo is flowing 
in the z direction. Find the current at each point. 


13. Further Problems Involving the Potential Equation 141 


The current is 


j=cE=—7Vo (3.183) 


where o is the conductivity of the material. The boundary condition of the 
problem is 


OD _ = 
(a) At large distances 
Aire 
a — oe Jor Cos 8 (3.185) 


The potential must therefore have the form 
Ate 


@ = — SFE J,(rcos 9+ see!) (3.186) 
Applying the boundary condition it is found that 
@ = (r+ 3) cos 6 (3.187) 
(b) The current in general has components as follows: 
Radial 
a 
jp =Jy cos (1S) (3.188) 
Transverse 
=—Jo sin (1 +53 z) (3.189) 
(c) The electric field in the hole is constant and equal to 
_3Jo 
E= IG (3.190) 


(d) A further problem which may be solved is to find the magnetic 
field B generated by the currents (3.188) and (3.189). Since the leading 
terms in these equations simply describe a constant current in the z 
direction, by calculating with the second terms only we can calculate the 
perturbation by the hole of the field of a constant current. 

The equation — re the vector potential of the field is, by (2.25), 


A=—pJ 


This equation is readily solved for 


r>a 


a a° 
J,=—Jo COs O73 Jo =—Jy sin O55, 


andj=Oforr <a. 


142 The Potential Equation 


The potentials come out to be: forr < a 


A, =—épJ ar cos 8 (3.191) 
Ag=—KJoar sin 8 (3.192) 
andforr>a 
3 
A, = — pJy nz cos 8 (dr —4a) (3.193) 
rae 
Ag = — pJo = sin 6(4r—#a) (3.194) 
From the equation 
B=VX<A 


it is now possible to calculate B. The only nonzero component is B,, and 
itis found to be, forr < a 


Bs =3peJ,a sin 6 (3.195) 
andforr >a 
3 
Bs = tuSo (r—4a) sin@ (3.196) 


Problem 3-18: Calculate the potential of a charge 
distribution uniformly distributed over the interior of the spheroid 


+= 1 (3.197) 


when the eccentricity is very small. To a good approximation, in this case 
2= q?(1+e’) 


according as the ellipsoid is prolate or oblate. 


The starting point is the formula 


oq) |e ape | ae | 3.198 
(r) jr—r’| Po = ir—r’| ( ) 
It is useful to introduce the substitutions: 
x’ = R' asin@' cos d’ 
y’=R’'asin@’ sing’ (3.199) 


z’ = R'bcos9@’ 


and similar ones, dropping the primes, for r. 
(a) It may be shown that, to the first order in é’, 


= eee ea aR! 
o= pub | [RRR 2¢ Row EAR 6200 


13. Further Problems Involving Potential Equation 143 


where 
z=Rcos 6, z’ =R’'cos@’ (3.201) 


From a straightforward calculation, the integral of the first term is found 
to be, for R < 1 (inside the spheroid) 


27 (1 —3R?) (3.202) 
and for R > 1 (outside the spheroid) 


4a 


3R (3.203) 


Consider next the second term; omitting the factor + $e? the integral is 


| (2~2')* sp, =| aa 1 
R R'<] 


3p) 
i, [R-R’P 6z’|R—R'| ]@r 


IR—R’| 
The second term in this integral combines with the one already calculated, 
so that it may be taken into account by multiplying the results above by 
(1+4e?). The first term may be changed into a surface integral over 
R'=1. 


Z—-Z OZ’ 
——_— = dS' = i 0’ (R cos 8—cos 6’ 
J . - R ( 


py i > sy 1 Y,"(6',¢' yyy" (6,¢) dQ’ (3.204) 


where the integral is over the solid angles of R’. The m # O terms drop 
out, and, from (3.117), 


me z Yi (8', ¢') ¥.°* (8, e) = Pr(cos 8)P(cos 6") (3.205) 


Using the orthogonality of Legendre polynomials it is now possible to 
evaluate (3.204). Only the /=0, 1, and 2 terms in the sum contribute. 
After some calculation the results may be written: 


(b) ForR < 1 
= a5 R°P, (cos 6) +3 I Re 1 (3.206) 
andforR > 1 
Amr} (2 2 
3 3a (3— 5 saa) P, (COs 8) ~3| (3.207) 


(c) The various terms may now be combined. The total charge may 
be written. 


q = 37 p,a"b (3.208) 


144 The Potential Equation 


The potentials come out to be, for R < 1 


7 af (1 _1 R°) — Ears R?-+ R®P, (cos || (3.209) 


eae 2 6 15 
andforR > | 
o=Llize la+(3—ay@) P (cos || (3.210) 
aR 6 \3 5R?/) ? 
where 
> wety 2 
R=7— +5 (3.211) 


In the terms of order €?, we inay put R = r/a without changing the result 
to order e?. 
(d) The quadrupole moment of the distribution is 


Ee, Oo 
Q,."° = +3qe’a 


= tq(b? — a’) (3.212) 


Problem 3-19: Find the potential of a uniformly 
charged circular disk of radius a and negligible thickness, carrying total 
charge Q. What are its multipole moments? 


Problem 3-20: Consider two identical rings of radius 
a and of negligible cross section separated by a distance b, the line joining 
their centers being perpendicular to the plane of each. Find the potentials 
and multipole moments when they carry equal charges g, and when they 
carry equal and opposite charges q. 


Problem 3-21: Find the potential of a charge dis- 
tribution of quadrupole moment Q, distributed over the surface of a 
sphere of radius a, its angular variation being proportional to P, (cos 9), 
6 being measured from the symmetry axis of the distribution. 


Problem 3-22: Consider two disks like those des- 
cribed in Problem 3-19, oppositely charged and at distance b, the line 
joining their centers being perpendicular to each. Find the potential as 
q— «and b — O such that lim gb = zra’a. (We call this a ‘‘dipole layer.”’) 
Find its multipole moments. 


Problem 3-23: Calculate the gravitational potential 
of a sphere of radius a with an eccentric spherical hole of radius b, whose 
center is at radius c. (b+c < a). 


Appendix 3B 145 


APPENDIX 3A: A Recursion Relation for P,;™(cos 0) 


We start with the recursion relation (2.84) for the Legendre poly- 
nomial of order (/— m): 


(21—2m+ 1) ePiim = (l—m+1)Piemsi t+ (l—m)Pim-1 
Substituting the definitions of the P’s as derivatives of 1/r with respect to 
z, and multiplying through by (/— m)\(— 1)'""/r'—", we obtain the equation 

— (2/— Am+ 1) zo,” () — ro (4) + (d— m)?9,/—"-1 (4) 


We now operate on this with 0,” to get 
7 (2/— 2m+ | )za.ma,n(=) a (l— m)*04"3,-"-4( 2) 


+r9,™9,/-™*! (7) + 2m(x+ iy)a.""19,/-"4# (=) 


Now (x+iy) passes through 0, operators, and 


(x+iy)@, (4) = 20, (2) 


so that the last term becomes 


2md,0,-™ (2) = 2mzd,'"a,/-™ (7) + 2m(1— m)d,”0,/-™ (7) 


on using Leibnitz’s rule for differentiation of the product (z times 1/r). 

Putting this back in the last term of the equation above, and using 
(3.107) to write the derivatives in terms of the associated Legendre 
functions, we get 


(21+ 1) e(—1)"(U—m) 15, Pe = 72(—1)(— m+ 1) Ps 


+ (2—m?)(—1)-(1—-m—1)! J Pts 


Dividing through by (—1)'(l—m)!/r', we arrive at the required recursion 
relation, 
(21+ 1)pP," = (l—m+1)P%,+ (+m)P2, 


APPENDIX 3B: Review of Theory of Linear Differential Equations of 
the Second Order 
Consider the equation 


Po(x)y” +p, (x)y' + p2(x)y =0 (B.1) 


146 The Potential Equation 


THEOREM 1 
If we know one solution, y,(x), we can find a second, yo. 
If we have two solutions, y, and y,, then 


Poy +Piyit Peyi = 9 (B.2) 
PoY2 + Pi Y2t Poy2 = 9 (B.3) 


Let us multiply the first equation by y., the second by y,, and subtract. 
We define the ‘““Wronskian”’ W of two solutions as 


Y1 ye ' ’ 
t ‘|= _ B. 
yi yt ¥1¥2— Yey1 (B.4) 
Then we obtain 
DW'+p,W=0 (B.5) 


— U ( 6) 


Once W is known, y, can be readily expressed in terms of y,. For if 
we divide the equation 


Viy2—Yeyi = W 


<(2)=35 


by y,”, we get 


dx\y,] yy? 
so that . 
W (x’) 
= < dx' B.7 
2 yi [ 2 ( ) 
x, being an arbitrary constant of integration. 
THEOREM 2 


If we know the solution of the homogeneous equation (B.1) we can 
find the solution of the inhomogeneous equation 


Poy" + Piy' + Poy = r(x) (B.8) 
First, we take this equation along with (B.1) 
Poi +Piyit Poy: = 0 
We multiply each by the solution of the other, and subtract to get 


Po(yiy” —yyr) +Pi(yiy’ —yyi) = r(x)y1 (x) (B.9) 
or . 
PoS' + pi S = ry (B.9a) 
where S = y,y’ — yy}. 


Appendix 3B 147 


Next, we combine this equation with that for W, (B.5) 
Po Ww : +P; WwW —= 0 
Again, we multiply each by the solution of the other and subtract: 


Po(WS' —SW') =ry,W (B.10) 
Dividing by W?, this becomes 


d/(S ry; 
dx @ DoW (B.11) 
On integration, this leads to 


_ ~ r(x')yi(x) 
. w | Dox) Wx) 


Substituting for § and dividing by y,? gives 


d/(y\_ W [*? r(x')yi(x') _, 
«(2 yy Do(x') W(x") 


_ 4 (ye\ [* r(x')y(x') ,, 
rad a| pola!) W(x") ceate) 


We now integrate this equation, integrating the right-hand side by parts 


yY ye (* ry) 2 ff? ywix)r’) 
ys al Do(x') W(x") J Do(x')W(x') 


_ [7 rx) Lye (x) yi (x') — yi (x) ye (x")] 
y= | Do(x') W(x’) — dx (B.13) 


The choice of lower limit in the integrals is of no importance, since the 
contributions from the lower limit are simply multiples of y. and y,, 
respectively. But we know, of course, that we may add to any particular 
soluticn of the inhomogeneous equation an arbitrary solution of. the homo- 
geneous equation; that is, an arbitrary linear combination of y, and yp. 
The correct combination will be determined in a particular problem by the 
given boundary conditions. 
Equation (B.13) may be written 


y(x) =f" r(x')G (x, x") dx’ (B.14) 


or 


where 


G (x, x’) — Yo(x)y (x!) ~y (x) yo (x') 


pola W(x’) oe 


is the ““Green’s function” of the problem. 
In general, the choice of constants in the solutions y, and y, enables 
us to find Green’s functions satisfying any prescribed boundary conditions. 


PRELUDE TO CHAPTER 4 


This chapter is devoted to the very powerful methods 
of Fourier and Laplace transforms. 

The Fourier transform in one dimension (Fourier 
integral) is first introduced as a limit of a Fourier 
series, and some simple properties and theorems de- 
rived (Parseval’s theorem, the convolution theorem). 
As a simple example of the use of Fourier transforms, 
we solve the problem of the forced, damped, harmonic 
oscillator. It is shown that the solution is a convolution 
of the forcing function and a Green’s function (corre- 
sponding to a unit impulse). 

Itis shown that, if there is no damping, permissible 
mathematical solutions are time-reversal invariant, 
and so may violate the physical condition of causality. 
This difficulty is removed by any irreversible process 
(for example, damping). For causal functions, rela- 
tions are obtained between the real and imaginary 
parts of the solution (Kramers—Kronig dispersion 
relations). These relations have important applications 
in electrodynamics and elsewhere. 

The next topic considered is that of correlation 
functions in time-evolving phenomena (both cross 
correlation, that is, the correlation between different 
quantities at different times, and autocorrelation, that 
is, correlation between values of a given quantity at 
different times). Defining the “power spectrum” of a 
time function as the magnitude squared of the Fourier 
transform of the function, it is shown that the power 
spectrum and the autocorrelation function are Fourier 
transforms one of the other. This will be used in Chap- 
ter 7 in connection with random signals, which are not 
well-defined and whose power spectrum can therefore 
not be calculated directly, but for which the auto- 
correlation function is directly attainable. 

We next develop the theory of Fourier transforms 
in three dimensions. These will be used in a wide 
variety of problems in subsequent chapters. The basic 
properties needed for their use are derived. 


149 


150 


Prelude to Chapter 4 


As an application, it is shown how Fourier trans- 
forms may be used to solve the Poisson equation of the 
previous chapter, and to determine Green’s functions. 

An interesting application of Fourier transforms 
is the proof of the Poisson summation formula, in both 
one and three dimensions. This formula makes possible 
a transformation from one infinite series involving a 
parameter to another. The relation between them is, 
that in the range of values of the parameter for which 
one series converges slowly, the other converges 
rapidly, and vice versa. In three dimensions the 
original series involves the periodicity of a lattice. The 
transformed form then involves the inverse lattice, 
exactly as defined in electron theory of solids. In fact, 
as in the solid-state problem what is involved is the 
Fourier transform of a function periodic in three 
dimensions. 

As an application of the Poisson formula, we 
calculate approximately the energy of a simple model 
of an ionic crystal, consisting of a periodic array of 
charges of alternating sign. This involves the calcula- 
tion of the so-called ‘**Madelung constant.” 

After a brief note on three-dimensional 6 func- 
tions, a further application is considered: the use of 
Fourier transforms to evaluate certain two- and three- 
center integrals occurring in atomic and molecular 
physics. 

We next develop the basic theory of Laplace 
transforms, and fundamental theorems for such trans- 
forms—shifting theorem, convolution theorem. By 
calculating some simple transforms (of powers, ex- 
ponentials, and trigonometric functions) we see that 
we may determine inverse transforms of all rational 
functions of the transform variable. 

In order to calculate inverse transforms in general, 
we make the connection between Laplace and Fourier 
transforms. The Laplace inverse theorem may then be 
derived from its Fourier counterpart. 

There follows a discussion of the Laplace trans- 
form of periodic functions, and the treatment of 
resonances in the response of second-order linear 
systems. 

As a further exercise in the combined use of 


Prelude to Chapter 4 151 


Fourier and Laplace transforms, we use them to 
re-solve the vibrating string problem of Chapter 1. 

The last part of the chapter is devoted to the dis- 
cussion of special functions by transform methods. 
Basic properties of |’ and B functions are derived, and 
Stirling’s approximate formula for the function of large 
argument, which will be used extensively in problems 
of probability in Chapter 7, is derived. This is done by 
the ‘“‘method of steepest descents,” the idea of which 1s 
first introduced here but which will be used again in 
Chapter 5 (Appendix). 

It is then shown that Laplace transforms make 
possible the solution of the general linear differential 
equation with linear coefficients. The most important 
example of such an equation is the confluent hyper- 
geometric equation, the theory of which is treated in 
some detail. Particular functions which can be ex- 
pressed in terms of the confluent hypergeometric 
function are then discussed, for example, Laguerre 
functions and Hermite functions, which will arise in 
quantum problems in Chapter 9. A very general linear 
differential equation of second order, but with quad- 
ratic coefficients, is introduced, the solution of which 
can be obtained in terms of the confluent hyper- 
geometric function by substitutions. The Bessel 
equation is seen to be a particular example of such an 
equation. By expressing the solutions in terms of the 
confluent hypergeometric function we obtain deriva- 
tions of both integral and series forms for Bessel 
functions. The integral form is particularly useful for 
studying the general character of Bessel functions. 
The rest of the chapter is devoted to the theory of 
Bessel functions, including the following topics: the 
second (Neumann) solution of the Bessel equation; the 
distribution of zeros of the Bessel functions; the Han- 
kel functions of first and second kind, which will be 
used in scattering theory in Chapters 5 and 9; the 
recursion and derivative formulas; the generating 
function and a second integral form related thereto. 


152 Prelude to Chapter 4 


REFERENCES 


Churchill, R. V., Fourier Series and Boundary Value Problems. New York: 
McGraw-Hill, 1951. 


Doetsch, G., Theorie und Anwendung der Laplace-Transformation. New Y ork: 
Dover, 1937. 


Franklin, P., An Introduction to Fourier Methods and the Laplace Transfor- 
mation. New York: Dover, 1958. 


Hochstadt, H., Special Functions of Mathematical Physics. New York: Holt, 
Rinehart and Winston, 1961. 


Jaeger, J. C., An Introduction to the Laplace Transformation. London: Methuen, 
1949. 


Morse, P. M. and H. Feshbach, Methods of Theoretical Physics. New York: 
McGraw-Hill, 1953. 


Papoulis, A., The Fourier Integral and Its Applications. New York: McGraw- 
Hill, 1962. 


Sneddon, I. N., Fourier Transforms. New York: McGraw-Hill, 1951. 


Sommerfeld, A., Partial Differential Equations in Physics. New York: Academic 
Press, 1949. 


Tranter, C. J., Integral Transforms of Mathematical Physics. London: Methuen, 
1956. 


f | 


FOURIER AND 
LAPLACE 
TRANSFORMS 
AND THEIR 
APPLICATIONS 


“Everything has two handles,—one by which it 
can be borne: another by which it cannot” 
Epictetus 


1. Introduction: Fourier 
Transform in One Dimension 


The Sturm-Liouville problem 


d? 
Gr tly =0 (4.1) 


with the boundary condition of periodicity on the range (— a/2 = x < a/2): 


1(9)=1(-9 4 


27Nn 
a 9 


produces the eigenvalues 
A= n = integer (4.3) 


and the corresponding eigenfunctions, 


y=sin Pans (4.4) 
and 
y = cos aan (4.5) 


153 


154 Fourier and Laplace Transforms 


which form a degenerate pair. The expansion in these functions 


co 


y=f(x)=> (<, sin 22 + b, Cos 2588) (4.6) 


n=0 a 


is the well-known Fourier series expansion. 


This expansion may be put into a form which involves complex 
exponentials: 


y= > Cnezmnria (4.7) 
where 
Cn = $(b, — id, ) (4.8) 
and 
C_n = C* = 3(b, +ia,) (4.9) 


The expansion functions in (4.7) are mutually orthogonal, in the 
sense that 
] al2 ; ; 
= e2tinzla e727imzla dx oe Sinn ( 4. 10) 
—a/2 
We note that, for complex functions, the orthogonality integral involves 
the product of one function with the complex conjugate of the other. The 
expansion coefficients in (4.7) are, then 


a 


1 al2 , 
Cc, == fF (x)e 27/4 dx (4.11) 
a J —ale 
By going to the limit in which the interval of periodicity a — ©, we 
can get the Fourier integral theorem. The first step is to introduce 


— ann (4.12) 
Then we can put 
a 1 al2 <% 
ea = Fon) =e [fae ds (4.13) 
At the same time, (4.7) becomes 
f(x) =72 SF (y)e™ (4.14) 


Now, let a— ©. We note that 27/a is the interval between y’s. 
Equation (4.14) becomes 


eee een 
or 


f(x) = in F(y)e'*” dy (4.15) 


1. Introduction: Fourier Transform in One Dimension 155 


while (4.13) gives 
F(y) =x] Ff (x)e—" dx (4.16) 


The two formulas (4.15) and (4.16), taken together, constitute the state- 
ment of the Fourier integral theorem. It is, of course, valid only when the 
integrals exist. General conditions of validity are discussed in various 
mathematical texts.! The function F(y) is known as the Fourier trans- 
form of f(x). 

Let us now obtain a few valuable results involving Fourier transforms 
in one dimension (that is, Fourier transforms of functions of a single 
variable). 

(a) If the transform of f(x) is F(y), the transform of df/dx = f'(x), if 
it exists, is iyF (y). 

To show that this is so, we write down the transform of f’ (x): 


os [- f! (x) e7@¥dx =+ F(x) e7iry tix [- f (x) e**%dx = (4.17) 


For the transform of f(x) to exist, f(x) must — 0 at +. Therefore the 
transform of f’ (x) is iyF (y), as claimed. 

By the same token, the transform of the nth derivative f™(x), if it 
exists, is (iy)"F (y). 

(b) The second theorem of interest to us is Parseval’s theorem: 


iZ Lf (x) Pdx = 2a im |F (y) Pdy (4.18) 


Consider the left-hand side of this equation with the limits of inte- 
gration taken as (—X,X); ultimately we will take X — ©. We substitute 
for f (x), f* (x) from (4.15): 


J “ J : F(y) e'*dy || im F*(y’) e“trdy’ | 


Let us now interchange the order of integration, doing the x integral first, 
to get 
° ; , sin X(y—y’ 
[a] o'royrsoy2 A) (4.19) 
Consider the properties of the function [sin X(y—y’)]/[y—y’] 
which occurs in this integral. When y= yy’ it has the value X. It goes 
through the value zero whenever y— y’ = nv/[X. Its oscillations diminish 
rapidly with amplitude. Its integral over y from — © to © has the value 7. 
The integral from y’ — B to y'+ B is 


8 sin XE 
——. d = 
[i peae= | 


']. I. Hirschman, Jr., Infinite Series. New York: Holt, Rinehart and Winston, 1962. 


BX 


sin Z 
Z dZ 


156 Fourier and Laplace Transforms 


where we have put Z = Xé. Thus, as X — ©, however small B may be, the 

contribution to the integral from |y—y'| > 8 becomes negligibly small. 
We see, therefore, that as X¥ — ©, 2[sin X(y—y’)]/[y—y’ ] becomes 

in effect 27 times the 6 function 6(y— y’), and the integral (4.19) becomes 


2ar f. F (y')F*(y")dy’ 


which proves the theorem (4.18). 

It 1s interesting to consider this theorem when in place of x the 
variable is a time variable ¢, and the transform variable y is a circular 
frequency w. Then F(q@) is the frequency amplitude of the time function 
f(t). We call 27|F (w) |? the frequency spectrum; it gives the contribution 
to the “integrated intensity” 


fi. \f@)Par 


from various frequency intervals. If the intensity is integrated over a 
finite range, it is clear that it does not break up into distinct contributions 
from each frequency, but there are “‘interference”’ terms. The longer the 
interval T = 2X of integration, the narrower is the range of frequency 
overlap. 


Problem 4-1: Show that the transform of the function 


f(t) = sin wot, |t| < 2aN 
Wo 
=0, \t| > ZN 
Wo 
is 
_@ 1 2a7Nw 
F(o) =— Z—2 SIN 


Sketch its frequency spectrum as a function of w. Show that the width 
between the first zeros on the two sides of its central peak is inversely 
proportional to N. 


Problem 4-2: Calculate the Fourier transform of 
cos x’. 


Problem 4-3: Calculate the Fourier transform of 


~O To 
OxeT+1 (e7+1)(e-7+4+1) 


2. The Convolution Theorem 157 


2. The Convolution Theorem 


The Fourier transform of the convolution 


C(x) =filx)*h(x) =f Ae )Ra—x’) de (4.20) 


@(y) = 2aF i (y)F2(y) (4.21) 
To show that this is so, we write down the transform of C (x): 


ease fe [ [A RG—x) dr |de (4.22) 


Let us change the order of the two integrations, and then, in the x integral, 
introduce a new variable x” = x — x’. Thus 


@=e[ deAe’) [Bryer dr" 


= 2nF,(y)F2(y) 
as claimed. 
Example 
Consider the differential equation 
FL +L + otf = (0) (4.23) 


Taking Fourier transforms of all terms of this equation, and letting the 
transforms of f(t) and g(t) be F(w) and ®(w), respectively, we obtain 
the equation 

[—w? + ikw + w?]F = ® (4.24) 


®(w) 


@ — Wo” — ikw 


or 


F(w) =— (4.25) 


Now if we designate the function whose transform is 
a Seen eee 
27 & — Wo? — ikw 


by G(t), it follows from the convolution theorem that 


fGj= is o(t')G(t—t’) dt’ (4.26) 
Consider, then 
eiwlt eda 


~~ O — Wp w? — wy? — ik 


G(t) =-— al (4.27) 


This integral can be evaluated by contour integration (Fig. 4.1). 


158 Fourier and Laplace Transforms 


Figure 4.1 


The zeros of the denominator are at the points 
Ws = 3[+ V 4a? —k? + ik] (4.28) 


There are clearly two cases to consider: k < 2w») and k > 2a . In the 
second case, the two roots are both pure imaginary numbers. In the first 
case, they are complex, and are located as indicated in Fig. 4.1. 

Instead of evaluating the integral from — © to ©, we first evaluate it 
from —R to R, and then, at the end, let R — ©. Let us construct a con- 
tour by drawing the semicircle of radius R with center at the origin in 
the w plane. The integral around the contour is 


Ate eda eda 
2a J-rp (w—@.)(@—w_) w)(w—w_) 5 - (#—,)(@—w_) w) (w—w_) 


where C denotes the semicircular contour. “a value of the integral is 277i 
times the sum of the residues at the poles; thus 


j=-— (4.29) 


e+! et 1 
= — jf | ——_— + | = — 112 a 
Peony Vegeta ee ee ees 


It is easy to see that this represents G(t) for t > 0, since in this case the 
integral on the semicircle C approaches zero as R — ©. For on the semi- 
circle w = Re*® = R(cos 6+isin@) so 

edo) —_ eif cos Ot e7Rsin Ot Rdée 


Since the rest of the integrand is bounded on the semicircle and Re~*®si"™ 
— Oas R — ©, the integral on C must approach zero. 


2. The Convolution Theorem 159 


Thus, fort > 0 
G (t) =A e-1 sin Vo? Tt (4.31) 


On the other hand, for t < 0, it is not fruitful to take a contour around 
the upper half-plane, since the integral on this contour would approach 
infinity as R — ©. We therefore take, in this case, a semicircular contour 
around the lower half-plane. The integral on the semicircle again ap- 
proaches zero as R — ©; however, in this case there are no poles inside 
the contour, and therefore 


G(t)=0, t<0 (4.32) 


This result is certainly what would be expected on physical grounds. 
For Eq. (4.26) may be interpreted in this way: that G(t—t’) is the 
“response” f(t) at time ¢ to the stimulus y(t’) at time ¢t’. This should be 
different from zero, then, only if ¢t > ¢’. In the mathematics, however, we 
may note that this feature followed from the fact that k > 0. The k(dy/dt) 
term in (4.23) is a dissipative term, causing a diminution in the response as 
t increases. It is the only term in the description of the system which is not 
time-reversible; thus, it is the only term which can distinguish a direction 
of time. Such a distinction in the direction of time is, of course, in turn, 
essential to the interpretation of causality. 

In the light of such considerations, it is interesting to consider the 
case of no dissipation (k = 0). Now if there were no dissipative processes 
in nature, the direction of time would indeed be reversible (that is, if a 
given system were observed evolving in reverse time order, it would be 
observed to obey the familiar laws of physics). But we know that, in fact, 
dissipative processes are always present in physical systems, however 
small may be their effects; thus, one should really let k — 0 from positive 
values rather than baldly putting it equal to zero; when ambiguities 
threaten to appear, they should be resolved by taking k to be positive but 
vanishingly small. 

Now, as k — 0 the poles in (4.27) approach the real axis. The first 
problem is, then, to interpret the integrals along the real axis, which now 
runs through these poles. Let us write (4.27) as 


_ 1 e dw _ [ e dw 
=a a | | W— Wy i a (4.33) 


We can consider only the second of the integrals in the square bracket: 
The other may then be evaluated by replacing w_ by w, in the result. 

Let us define in this case a “principal part” integral; that 1s, we 
cut out from the range of integration an interval of length 6 on both sides 
of w_ =— po. The principal part P is then obtained by taking the limit as 
§5— 0. A contour may then be constructed as shown in Fig. 4.2. This 


160 Fourier and Laplace Transforms 


differs from that of Fig. 4.1 by virtue of the small semicircle of radius 6 
about w = w_. We then have 


iwt iwt 
P joe eM da +] € — dw = 2mi (residue) (4.34) 


@— @®_ @— W_ 


icot 
=P | en | eM 8(o— 0) dw 
W—W_ 


Thus, when the pole comes on the real axis, we may write symbolically, 


| es 1 a) 
a Pt 78 (w w’) (4.35) 


This is interpreted to mean, that an integral involving 1/(@—w’) is to be 
understood as the principal-value integral + zri times the integral with the 
6 function. 

If, instead of introducing the Green’s function explicitly we had tried 
to calculate f directly by inverting (4.25), this could also have been done. 
The integral in 


fi) =- | 3 eee 7a a (4.36) 
can usually also be evaluated by contour integration. To the contributions 
from the poles of the transform of the Green’s function, we must add 
contributions from the poles of ®(w). If k = 0, the contributions from = @) 
can be evaluated using (4.35); alternatively, the result may be evaluated 
for finite k and the limit k — 0 taken later. 


Figure 4.2 


3. Causality and Dispersion Relations 161 


3. Causality and Dispersion 
Relations 


We define a “‘causal” function as a function g(t) which is zero for 


t < ft): For simplicity, we choose tf) = 0. Let us consider the question of 
the limitations on the Fourier transform of such a function. 
Written in terms of its transform G(w), the function g(r) is [cf. (4.15)] 


g(t)=[  G(w) ede (4.15) 
where 


G(w) =54 ie e(t) en dy 


oa g(t) ee dt (4.37) 
0 


This defines G(w) for real w; however, the definition can be extended 
into the lower complex w plane, since the integral converges if w has a 
negative imaginary part. 

Consider now the integral 


Giese | © GO) 4! (4.38) 


27ri w' —@ 


where w is real. We evaluate this using the contour of Fig. (4.3). Let us 
suppose that G(w) is such that the integral around the semicircle in the 
lower half-plane is zero as its radius approaches infinity. Taking now the 


Wo 
@ 


Figure 4.3 


162 Fourier and Laplace Transforms 


integral around the complete contour of Fig. 4.3, we find that its value, 
which is zero, is made up of two parts: A principal part integral and a 
contribution around the semicircular contour (6-function contribution). 
Therefore it follows that 


1 


P 27 


Go) 1 = 
[Gea +5 G(w) =0 (4.39) 
If we now break up G () into its real and imaginary parts 


G(w) = G,(@) +iG.2(@) (4.40) 


and equate real and imaginary parts of (4.39), we get the relations 
(Kramers—Kronig relations): 


Gy(o) =-2 P [SP aes (4.41) 
and 
G,(w) = 1p i Go) ay (4.42) 


Equations (4.41) and (4.42) have been shown to be valid only if the 
integral on the semicircle of infinite radius around the lower half-plane 
vanishes. 


Problem 4-4: Apply the foregoing to the transform 
of the Green’s function (4.27) of the problem discussed in the last section. 
Take 


1 
PN ct Se ne Ss es, 
G(o') w'? — a)? — ike’ 
where, for convenience, we omit the factor (— 1/27). Then 
C.= w'?— wy” 
1 (o'? — @,?)? + Pw’? 
and 
— Ko 
(w’? — @,”)? + ko’? 
Verify by direct integration that these functions G, and G, satisfy the 
Kramers-—Kronig relations. Sketch these functions. 


G, 


Problem 4-5: The (imaginary) conductivity of an 
electron gas of N electrons per unit volume in a long-wavelength electro- 
magnetic field of frequency w Is 
_iNé 1 


OF ae 
mM @O— 0, 


4. Linear Response Functions 163 


when there is a constant magnetic field B and the wave is right circularly 
polarized relative to this direction. —e and m represent the electron 
charge and mass, respectively, and w, is the Larmor frequency eB/mc. 
Show that the real part of the conductivity, which determines the absorp- 
tion of the wave, is 


o,;=— we —— 6(w—w,) 


(The Kramers-Kronig relations obviously do not apply directly to 
“functions” as anomolous as the 6 function. This difficulty may be over- 
come by attributing to w a small imaginary part i/7.) 


Note the reason for choosing to deform the contour around the lower 
half-plane. From (4.37) 


G (@, + iws) =x]. g(t) e " eet dt (4.43) 


Therefore, provided g(t) remains bounded, the integral for G(w) will 
converge so long as w, < 0. 

The Kramers-Kronig relations (4.41) and (4.42) are also obviously 
only valid if the integrals on the right-hand side converge. This would not 
be the case, for instance, if G,,G, approached a constant as w — © on the 
real axis. In that case, however, we could have obtained dispersion 
relations using in (4.38), instead of G(w), the function G(@)/(w— w,)s 
where @, is arbitrary. Taking w, = 0, for instance, instead of (4.41) and 
(4.42) we would get the relations 


G,(o) =—=P | ay de’ (4.44) 
and 
G2(w) ==P | sey de’ (4.45) 


4. Linear Response Functions 


The foregoing considerations are applicable to the following rather 
general problem. Take a physical system, subject to an external stimulus 
y(t), which responds linearly to this stimulus. This is expressed mathe- 
matically in the following way: if the function f(t) measures the “‘re- 
sponse,” then it may be written in the form 


t 
f()=f Ue-r)e(t')ae (4.46) 
where [(t) is “‘causal,”’ that is to say 


(t)=0, 1¢<0 (4.47) 


164 Fourier and Laplace Transforms 


Taking the Fourier transform of (4.46) and using the convolution theorem 
(4.21) gives 
F(w) = 27L(w)P(o) (4.48) 


If the “stimulus” has a definite circular frequency, the response per unit 
stimulus at that frequency is 27L(@). 

It follows from (4.47) that this function must satisfy the Kramers- 
Kronig dispersion relations. 


5. Cross Correlation and Auto- 
correlation Functions 


Consider two time functions f,(t) and f,(t), whose square integral is 
finite. We may define a cross correlation between these functions for 
times separated by an interval 7 


[. AMA(e+7) dt 


Cy (t) = > eB (4.49) 
[fade f A2(0) at| 
If fi =f. =f, this becomes the autocorrelation function 
f A()ACt+7) at 
C(r) = (4.50) 


ff at 


This function has a number of interesting applications; probably the most 
interesting is its use in determining the frequency spectrum of certain 
stochastic functions, an application which we discuss in detail in Chapter 7. 
Let us derive two important properties of this function. 
The first is that |C,.(7)| < 1. This is seen by considering 


NA) =f LA(t) +AQ(t+7) 2 dt = 0 (4.51) 
for arbitrary real values of A. Putting 
n2= fo f2(t) dt (4.52) 
and 2 
n2={ f(t) dt (4.53) 
we can write (4.51) as 
N2n-? + 2AN NeC 12(T) +n, = 0 (4.54) 


By completing the square, we find that 
[Ang + Cie (7) J? + 2?[1 —Ci2?(7)] = 0 


5. Cross Correlations and Autocorrelation Functions 165 


for all real \’s. It is therefore true for the value which makes the first 
(squared) term zero. Therefore 


C,(7) = 1 (4.55) 
as Stated above. 
The second property may be expressed in the theorem that the 
Fourier transform of C,.(r7) is 


2 
in Fi (@)F2(o) 


A special consequence is that the autocorrelation function is proportional 
to the frequency spectrum of the function f(t). 

To demonstrate the validity of this result, consider first the Fourier 
transform of the numerator of C,.(7). It is 


mf em [| AOAE+) ar| dr 


If the order of the integrations is interchanged, this becomes 


om [Ao | [let rer ar] dt 


For fixed ¢, in the inner integral it is possible to replace 7 as the variable 
of integration by ¢t’ = t+7. Then we get 


= [Awe dt [o perewe dt'=2rF *(w)F.(w) (4.56) 


Therefore the Fourier transform of C,.(r), F (C,2(7) ), is 


* 
F (Cw (t)) = 2aF i (w)F2(w) (4.57) 
NyNg 
The Fourier transform of the autocorrelation function is 
F (C(r)) = =F |F(o) (4.58) 


From Parseval’s theorem (4.18) the integral over frequency of this 
transform is 1. Thus, the Fourier transform of the autocorrelation func- 
tion is the frequency distribution function of f (t). 

In practical terms, the usefulness of the result stems from the fact 
that, even if one has a function (“signal”) for which no analytic form is 
known, but the autocorrelation function can be determined either analyti- 
cally or numerically, the frequency distribution function of the signal can 
be obtained. 

Consider the application of the above considerations to a system 
with a linear response. The correlation C,.(7) between stimulus f,(t) = 


166 Fourier and Laplace Transforms 


y(t) and response f,(t) =f(t) has a transform given by (4.57). Here 
F #(w) is ®*(w), whereas F,(w) is, from (4.48), 27L(w)®(w). Thus, 


F (Cu(z)) = FEO 1 (o) (4.59) 


Thus, from a knowledge of the correlation between stimulus and response 
as a function of time, and of the frequency spectrum of the stimulus, it is 
possible to determine L(w), which determines the response of the system 
at each frequency. 

The result (4.59) may be written in the simpler form 


F (f° ele (ttr)dt)=407|0(0)/L(w) (4.60) 


6. Fourier Transform in 
Three Dimensions 


A function of several variables may be transformed with respect to 
each of its variables; in taking the transform with respect to any one of 
these variables, the others are treated as constants. The commonest in- 
stance in physics is that of a function of the three Cartesian coordinates 
of a point in space. Just as, for a function of a single variable, the Fourier 
integral may be taken as the limit of a Fourier series as the interval of 
periodicity becomes infinitely large, so may the three-dimensional trans- 
form of a function of space coordinates be considered as the limit of a 
Fourier series in each of the three variables. 

It is most convenient to represent a function of coordinates in terms 
of the position vector r with components x, y, and z. We define the Fourier 
transform F(k) = F (kz, ky, k,) of the function f(r) = f(x, y, z) as 


F(k) = (3) | fre-** d3r (4.61) 


where dr represents the volume element dxdydz and the integral is over 
all space, that is, the range from — © to © in each coordinate. This follows 
from (4.16), each coordinate being transformed separately. The expres- 
sion of the function in terms of its transform follows similarly from (4.15), 
and is 


f(r) = { F(k) e®* dk (4.62) 


where the integral is over the whole of k space. 

We now demonstrate a number of useful theorems. In all instances 
the validity of these theorems will, of course, be dependent on the exis- 
tence of the integrals indicated. That is, the functions being transformed 
will be assumed to go to zero faster than 1/r? as r= |r| — ©. In all cases 


6. Fourier Transform in Three Dimensions 167 


surface integrals over surfaces at infinity will be taken to be zero. Use 
will be made of Gauss’ theorem (the divergence theorem) in vector form: 
Where A(r) is a vector function of position 


J V-A(r) d*r = J A(r) -ndS (4.63) 
(v) (s) 


where the integral on the left is taken over an arbitrary volume and that 
on the right is over the surface surrounding that volume. n is the unit 
normal vector drawn outward from the surface. 

(a) The transform of the gradient Vfis ik F(k). 

From (4.61) the transform of fis 


F (Vf) = (s:) | Uf: ene d5 


= (s:) | V(fe-*") d*r+ik | few-are| 


The first integral may be transformed into the surface integral f fe-*'n dS 
over the surface at infinity and so is zero. Therefore, 


F# (Vf) = ikF (k) (4.64) 


(b) The transform of the divergence of the vector function a(r) is 
ik - A(k), where A(k) is the transform of a(r). 
By definition the transform of V - ais 


3 
F(V-a)= (3) | (V-a)e-erd5r 
By the use of the identity 
V- (ae~kr) = (V-a)e“*'—ik- ae-&r (4.65) 


the integral may be written in two parts. The first, f V -(ae—*-r) dr, 
transforms into the surface integral f e~*'a-ndS which approaches 0; 
the second is ik - f ae~*-'d3r. Therefore, 


F(V-a) =ik-A(k) (4.66) 
(c) The Fourier transform of the Laplacian of a function f, V7f, is 
F (Vf) =—k?F (4.67) 


This follows from the two previous results, if a is taken to be Vf. 
Thus 
F(V*f) = F(V- VF) 


= ik: F (Vf) 
= ik - ikF 
=—k?F 


168 Fourier and Laplace Transforms 


(d) The convolution of f,(r), f2(r) is 
C(t) =f A )AG—r’) dr’ = f(r) h(r) 


The transform of the convolution is 
F(C(r)) = (277)? F, (k) F2(k) 


where F;, F, are the transforms of f, (r) and f,(r), respectively. 
By definition the transform of the convolution of f, (r) and fA (r) is 


(se) fem| | Kerker) a | 


Assuming that the order of the integrations may be changed, and replacing 
r as a variable of integration by R = r—r’, this becomes 


(se) | Aeede-es | | Rema R| dr’ 
Therefore, 
F (C(r)) = (27)8F; (k) F2(k) (4.68) 
Conversely, it follows that 
C(r) = (2m)? [ F,(k)Fo(k)e** dk (4.69) 


7. Solution of Poisson’s Equation 
by Fourier Transforms 


Using the preceding theorems, it is possible to solve most of the 
standard linear differential equations of mathematical physics. Not only 
does the Fourier transform method provide a very powerful tool for such 
problems, but it also provides considerable insight into the physical 
principles which they involve. 

Consider, as a first illustration, Poisson’s equation for the electro- 
static potential arising from a charge distribution p(r): 


Vb =—4ap (3.23) 


Taking the Fourier transforms of the two sides of this equation and 
designating the Laplace transforms of ®(r) and p(r) by ®o(k), P(k), 
respectively, leads to the equation 


®)(k) = “7 P(k) (4.70) 


If we designate 


(4.71) 


8. Poisson’s Summation Formula in One and Three Dimensions 169 


it follows from (4.68) that 


Do(k) = (277)°Go(k) P(k) (4.72) 
Therefore, 
@(r) = (2m)* { Go(k)P(k)e*™ d?k (4.73) 
which is, by (4.69), 
P(r) = f p(r')G(r—r’) d*r' (4.74) 


where G(r) is the function whose transform is Go(k) = 1/27?k?. Again, 
we may refer to this as the ‘““Green’s function’ of the problem— more 
specifically, in this case, the Green’s function for the potential equation 
in free space. 

We may determine directly the function G(r) by the Fourier inverse 
theorem (4.62): 


G(r) =55 ule pew td (4.75) 


Spherical polar coordinates may be chosen for the variables in k space, 
using the direction of r as polar axis. If 4 designates the cosine of the 
angle between k and r, integration over polar angles gives 


oo 1 
G(r) =+ =i kedk I dz er 


a2 ee sin kr I 
=+ (4.76) 
The potential (4.74) then becomes 
®(r) = | o(r’) —— de’ (4.77) 
jr—r'| 


a result which we wrote down earlier [Eq. (3.138)] on the basis of a 
physicai argument. 


8. Poisson’s Summation Formula 
in One and Three Dimensions 


In this section we derive a mathematical formula which is frequently 
used in mathematical physics in cases in which the solution to problems 
leads to an infinite series. We derive it first in its simplest one-dimensional 
form, and then give the generalization to three dimensions. 

Consider the following function, defined in terms of a series: 


S(x) = > Ff (x+ nx) (4.78) 


n=— 


170 Fourier and Laplace Transforms 


where f is an arbitrary function. It is clear that S(x) is Periodic, with 
period x), and so may be expressed as a Fourier series 


S(x) —_ > C, e2zilzixo (4.79) 
l=—o@ 
The coefficients C; are given by 
1/2)x0 
C,= i S (x) e72milzlx dy (4.80) 
Xo J ~(1/2)20 
Substituting for S (x) from (4.78), 


1 oo 1/2)r0 ; 
C, =— >> i F(xt+ NX) e27ilxlxo dx 


XO awe J (1/2020 
In aterm corresponding to a given n, introduce y = x + NXo to get 
1 0 n+(1/2))xL0 ; 
a2 > [OO Foy etry 
XO aco J (n-(1/2)).x0 


_ 1 


ial i: ( y) e7 27ilulxo dy 
Ne 


on summing the terms. Therefore, 


C,= 20 F (=) e2rilalzxo (4.81) 
Xo Xo 
where F(k) is the Fourier transform of f (x). Thus S(x) may be written as 
ed 27l 
S = (= 27rilx/xo 4.82 
dae oF e (4.82) 


This is the Poisson summation formula. 


Example 
Evaluate 
= 1 
n=0 


In the first place the series may be written 


_1 ae Sone 
cia att = a+r? 


This then becomes an application of ‘is Poisson formula in which f (u) = 
1/(a? + u?), x9 = 1, and x = 0. The transform of f (u) is 


1 


1 
F(v) 7 | Paw? 


~iUuv d u 


8. Poisson’s Summation Formulain One and Three Dimensions 171 


For v > 0 this may be evaluated as a contour integral in which the 
contour is completed with a semicircle of radius R — © around the lower 
half-plane. By the residue theorem, the integral is 


: : ees ee ee J —av — 1 —a 
i [residue at u = —ia] = i(- >a) e 0° 


For v < 0 the contour is deformed around the upper half-plane, and the 
result is, similarly, 


i ea alu! 


2a 


— Ji 27 - —al27l| 
So at 4a me . 


l ~2mla 
=55-2 ct ok (4.84) 


Therefore, 


The last series is a geometric series and so may be evaluated giving 


SNE = a. 
So= 2a2 2a a3 a1—e-274 G2) 


Problem 4-6: Show that 


Problem 4-7: Show that 


wed I Te 
a= » a®+(2n+1)2 4a1+e7-7 


Problem 4-8: Show that 


(—1)* ame 
Se = : a2 + n? =a 1 — e274 


Problem 4-9: Evaluate the series 


 _ 
> - 79 


Although in this case the Poisson formula has permitted an explicit 
evaluation of the series, this will, of course, not generally be the case. The 
formula still has considerable value, however, as an aid to numerical 
evaluation of the solution. Its value lies in this: That whereas the original 
series (4.83) is most rapidly convergent for small a, the transformed series 


172 Fourier and Laplace Transforms 


(4.84) converges most rapidly for large values of a. This feature is not 
peculiar to the present example; quite generally, it is found that precisely 
under those conditions in which one series converges slowly, the other 
converges rapidly. 

Consider now the three-dimensional case, in which the sum to be 
evaluated is 


S(r) => f(r+R,) (4.86) 
where the R,,’s are “‘lattice vectors” 
R, = T+ NeT. + N3T3 (4.87) 


the n’s taking on all integral values from —© to ». Although the theorem 
in this case follows much the same pattern as in the one-dimensional case, 
the difference is sufficient to justify following the derivation through in 
detail. 

The function S(r) is periodic with respect to displacements through 
each of the vectors 7,, 7, and 73. It can therefore be expanded in a suitable 
Fourier series. To determine this series, it is necessary to introduce the 
“inverse lattice vectors” 


where 

OQ = 7, + T,X Ts (4.89) 
It follows immediately that 
b; - 7; = 5, (4.90) 
The Fourier series can then be expressed in the form 

S(r) => o(K,)e27&r (4.91) 

where 
K, = 1|,b, + Lb, + kb, (4.92) 


the /’s being integers. The sum in (4.91) is over all positive and negative 
integral values of the I’s. 

To determine the coefficients o(K,), it is necessary to prove the 
orthogonality theorem: 


J e27i(K Kr) “1 3p = O85, (4.93) 


The integral is taken over a parallelpiped constructed of the vectors 
71, 72, and 73, and dx, ,, is Zero when K, ¥ K, and unity when K, = Ky. 
The theorem is easily proven by introducing 


VP = 27, + 272 + 23T3 (4.94) 


8. Poisson’s Summation Formula in One and Three Dimensions 173 


The integral then becomes, by virtue of (4.90), 


1 pl fl 
fo, Sq exp 2mil (hla + (h—B)ze+ (eK) ze] J dz, diy dz, 
(4.95) 
J is the Jacobian of the components (x,, x2, x3) of r with respect to (Z;, Ze, 
23). Lhus 
Tiz Tiy Tiz 
J= = 71°) X73 = 20 (4.96) 


Ter Tay Tez 


T3x Tsy T3z 
The integral is, in fact, the product of three similar integrals of the form 


fi e2mit-We1 dz, = 0), LhéU 


= 1, L= (4.97) 
Theorem (4.93) follows. 
If (4.91)is now multiplied by e-27K-r, and we integrate as in (4.93), 
we find that 


o(K,) = 4 | S(r)e727iKer g3p (4.98) 
the integral being over the cell of volume 2. 


We are now in a position to complete the derivation of the summation 
formula. Substituting for S(r) from (4.86) in (4.98) it is found that 


HK) =A S| flet Reem dtr (4.99) 
n cell 
In a given term, the substitution 
r’=r-+R, (4.100) 
leads to the result 
o(K)=a S| flr’ )e mmr der (4.101) 
n celln 


The sum is over cells displaced from the original one by the “lattice vec- 
tors” R,. But the sum of integrals over all such ‘“‘cells’’ is the sum over all 
space, so that 


o(K, =% | Feeomer d*r' (4.102) 


at AST) F (2nK,) (4.103) 


where F(k) is the Fourier transform of f(r). We thus arrive at the Poisson 
summation formula in three dimensions: 


ea 


S(r) = S f(r+R,) = = 2 F (27K, ) e27iKr (4.104) 


174 Fourier and Laplace Transforms 


This result has a number of applications, as we shall see in subse- 
quent chapters. Suppose, for instance, that one wishes to solve a linear 
problem (potential, heat conduction, scattering, etc.) with a point source, 
and boundary conditions which can be satisfied by a system of image 
sources at (—R,). If the Green’s function (point source solution) in an 
infinite region is f(r) for a source at the origin, the solution will be of the 
form (4.86). Equations (4.104) then makes it possible to transform from 
the solution in terms of images to one in the form of a Fourier series. In 
general, where the one solution converges slowly, the other converges 
rapidly. 

A second application is in the theory of solids, where one may desire 
to evaluate sums of functions centered at the various lattice sites R, of the 
atoms of a crystal. For example, the potential at any point in a crystal 
might be considered as the sum of contributions from the charge distribu- 
tion in its different “‘cells.”” Equation (4.104) would then give the Fourier 
series representation of the potential. In particular, in an ionic crystal, 
one can calculate the potential energy per ion as the sum of the contribu- 
tions from all the other ions. 

Let us carry out this calculation for a cubic crystal, the successive 
ions having total charges of « and—e. The charge distributions on the ions 
are in general different, but for simplicity we assume them to be the same, 
namely, + €p)(r—R,,), where R,, is the location of the nucleus of the nth 
ion. The total charge density may then be taken to be 


p(r) =e > cos wA-R, : po(r—R,) (4.105) 
Here ‘ 
R, = n,ai+ n,aj+ ngak (4.106) 
and 
d=1(i+j+k) (4.107) 


where i, j, and k are unit vectors along the coordinate directions and a is 
the lattice spacing. Since A-R,=n,+n.+ 73, the factor cos 7A -R, 
ensures the alternation of sign of the charges. 

Let us use theorem (4.104) to transform the density (4.105). It may 


be considered as asum > f(R,), where 


f(r’) =ecosaAr-r'p, (r—r’) (4.108) 


The Fourier transform of this function is 


F(k) = On [cos Tr°P' po (r—r’) ek der’ (4.109) 


8. Poisson’s Summation Formula in One and Three Dimensions 175 


Writing the cosine as a sum of complex exponentials and introducing the 
variable 


R=r-r’ (4.110) 
this becomes 


F (k) = 5 [e729 Po(k— ad) +e-iermnre Po(k+a7A)] (4.111) 


where P,(k) is the Fourier transform of po(r). The required charge 
density may then be written as 


says GE > [e-2mi(K,—(1/2)4)-r Py (2arK, — 7d) 


+ e-2i(K.—(1/2)4)-¢ Po (277K, + 7X) | (4.112) 


This may now be used to calculate the potential at the position of the 
ion at the origin. However, since (4.112) includes the charge of that ion, 
we must subtract po (r) from (4.112). 

The potential is, then, 


®(r) a pl a oe (4.113) 
r—r’ 
Making use of the fact that 
ale; ne 
| (7 at = 5a (4.114) 


itis possible to calculate ®,. 


4a € 1 1 
®,(0) = {_—_-P 2K, — 7A) +————_ P, (27 K, + a») 
1 a2 > 1K, —3A)? o( 1— 7X) 1K, + 3A)? o (27K, + 7X) 
(4.115) 
On the other hand, 
®, (0) =e | Foote’) ate’ (4.116) 


For purposes of evaluation of the lattice sum, the precise form of po 
should not matter provided the ions do not overlap; that is, provided py is 
negligible outside the cubical “‘cell’’ surrounding a given atom. Assuming 
that the total ionic charge is unity, it is convenient to choose 


3/2 ] aes 
po(r) =e (2) we (4.117) 


176 Fourier and Laplace Transforms 


The transform of po (r) is 


Po(k) = gaewwn (4.118) 


leading to a specific form for (4.115). The integral in (4.116) is elementary 
and gives 


®,(0) = —*5 (2) (4.119) 


The potential energy of a point charge at the origin Is, then, 


If we now take a > 1, we should get a good approximation to the potential 
energy of a given charge due to alternating point charges at the other 
lattice sites. Using (4.115), (4.118), and (4.119) we obtain the formula 


ee) | (2 age Oe ee 
tu=—7O\(5) ae, UDO 
exp (—=)[(4—-#)?+ (2—-H*+ (4-9 (4.121) 


The coefficient of e?/a, evaluated for sufficiently large a, is the ““Madelung 
constant.”’ A quite accurate approximation (with an error of < 1%) to the 
correct value of 1.7476 can be obtained by taking a = 2.57’, a choice 
which leads to quite simple calculations. Adequate convergence in this 
case is obtained by summing all terms up to |/;—3| < = (@= 1, 2, 3). To 
obtain higher accuracy, it would be necessary to take a larger value of 
a (57? would be a reasonable choice). It would then be necessary to sum 
more terms; the labor, however, is not prohibitive and in this case gives 
four-figure accuracy. 


9. A Note on 6 Functions and 
Three-Dimensional Transforms 


It is interesting to note that many of the major results of Fourier 
transform theory may be obtained formally by the use of the Dirac 
‘6 function” in three dimensions. Defining it by the relation 


f f@')8@—r') dr! = f(r) (4.122) 


where f (r) is a well-behaved function, its transform is, from (4.61), 
ae 1 —ik-r’ 
F(k) = (In) e (4.123) 


Therefore, from (4.62), if the Fourier integral theorem were valid for this 


9. ANote on 6 Functions and Three-Dimensional Transforms 177 


“function,” it could be written 
’ 1 ik: (r—r’) 73 
d(r—r’) = Ons | e d*k (4.124) 


Of course, the integral on the right does not really exist. Equation (4.124) 
takes on meaning, however, if both sides are integrated with another func- 
tion and (4.122) is used. If each side is multiplied by f(r) and integrated 
over coordinates the resulting equation is 


sey =(E) [ reen] [emer ate] are 


Reversing the order of the integrations on. the left, a step which leads to 
well-defined integrals if f(r) has a transform, reproduces the Fourier 
integral theorem: 


f(r) = i F(k) e** d°k 


_ | (sm) | f(r’) ewer’ | em dk (4.125) 


Equation (4.124), therefore, can be considered as valid in this sense. 

The way in which it is most commonly used is, that when one has an 
integral such as that in (4.125), the order of the integrations is reversed, 
and the “invalid” k integral is replaced by the 6 function, which may then 
be evaluated. The use of the 6 function and its transform in this way is 
then completely equivalent to the use of a valid Fourier transform 
theorem. 

Using 6 functions in this way, the convolution theorem may be 
recovered quickly. For, expressing the functions f(r) and g(r) in the 
convolution by their transforms F and G, respectively, the convolution 
may be written 


C(r) = f f@')g(r—r’) ar’ 
=| [ F&) e* dk| [ Gk’) ex ak’ | ae’ 
Doing the r’ integral first gives 
C(r) = [ F(k)G (k’)e* * (21)83(k—k’) d*k’ d°k 


On performing the k’ integration, the expression (4.69) is obtained for the 
convolution in terms of the transforms of its component functions: 


C(r) = (2m)* [ F(k)G(k)e** dk 
The inverted equation (4.68) then follows. 


178 Fourier and Laplace Transforms 


10. Two- and Three-Center 
Integrals 


Three-dimensional Fourier transforms provide a useful technique 
for doing integrals over functions centered on several points, such as are 
found in electrostatics and in atomic and molecular calculations. We con- 
sider a number of such integrals. First, however, we list several useful 
transforms. 


(a) (x) | eed 


Putting « = cos @ as the cosine of the angle between k and r, using k as 
polar axis, and doing the angular integrations, the integral becomes 


[oo] —ar 1 @ 
Qa | a | enikra du| Pr dr= = (” ener sin kr dr 
—1 40 


0 


=F im/ e~* er dr 
k 0 


= Wake (4.126) 


so that the transform is (1/27r?)1/(a? + k?). 


(b) Differentiating with respect to a, another useful transform is 
obtained: 


I 7 —ar ,—ik-r 43 1 a 
(Gq) | wet tp enn 


Transforms of e~* multiplied by positive powers of r may be obtained by 
further differentiations. 


Problem 4-10: Evaluate the transforms of 


(i) re~"" 
a ae 
(ii) 7 
and consequently express these functions as Fourier integrals. 


Let us now consider some two- and three-center integrals. 
(I) THE “OVERLAP INTEGRAL” 


I,= f e-oT o~Blr-R| q3p (4.128) 


10. Two- and Three-Center Integrals 179 


Replace e~” by its expression as an integral transform: 


er = 


| eee nee 
Thus 
_ & tn: | —Blr—R| pik-r f3 

n= {oom é leks d3p 

But by (4.127), this is 
8 ek R 

= S08 | ere (4.130) 

This may be evaluated by observing that 


1 —_ 1 1 1 
(+R) (F+R) Po leer BTR ren) 
Therefore, 
ex Rdek Da? eR =n e BR 
@TR)(ETE) “Be —R 
where R is the magnitude of the vector R. 
Calculating successively derivatives of the two sides with respect to 
a and 67, it is found that 


(4.132) 


ei Rq3k ad Ferry 1 sige: 2 (>) 
(o2 + k2)7(B2-+k2) (PB? — a? Ja © ( B? — a2)? R 
(4.133) 
and 
_ 8 ek Rd*k 
= Zab | (a? + k?)? (2 + k?)? 
8a 


=p) mts | (8 — a?) (Be~ + ae fF) — “EB (6-08 — e-*8)| (4.134) 


This formula is not too convenient when 6 = a. It is, of course, pos- 
sible to take the limit as B — a. Alternatively, it is possible to evaluate 
(4.130) by starting from 

eik-R : e ok 35 
| pepe = 2 (4.135) 
and differentiating successively with respect to a?: 
er _w 
— p—-aR 
| wee thet ae (4.136) 


ek-R 


qr 
Cot + haps Pk = Goat aR)em (4.137) 


180 Fourier and Laplace Transforms 


and finally 
e ik -R 


2 
ay Pk = 54g5 (3+ 3aR + a®R?)e~2? (4.138) 


(Il) COULOMB INTERACTION BETWEEN TWO HYDROGEN 
ATOMS 


A hydrogen atom in its lowest energy state consists of a nuclear 
charge + € and a negative charge distribution 


= —¢ & gor 
p(r) =—e ge (4.139) 
Taking the origin at one atom, and calling the position vector of the other 


relative to this one R, the total potential energy of the Coulomb interaction 
between them is 


ase = 2 1 —a\|r—R| 3 2 oF | 3 3! par’ 
V=R 2€ [te dr+e(Z) d°r d*r'e i 


e—a|r—R| 
r—r’| 
(4.140) 

The first term represents the interaction between the two nuclei, the 
second the two (equal) interactions between the electron distributions and 
and the nuclei of the other atom, and the third the interaction between 
the two electron distributions. 

The second term, which we call V,, can be done by first introducing 
the Fourier transform (4.129) of the exponential: 


V,=—-—22 = &* | Sate” (FR) d3k (4.141) 
The r integral may then be evaluated, giving 
a4 l 
V,=-—e 72 | R(e+kyp e-k-R q3k (4.142) 


The integral is now a special case of (4.133), with B = 0: 


4 21-e% 1 
—_ 2% 9» a a a 
a Tm E R oe © 


=—¢ lz (l—e-*) —ae~**| (4.143) 
The last integral, which we call V,, is a three-center ‘integral (r, r’, 


and R), and is the most difficult. Introducing again the Fourier transform 
of e-2'-R it becomes 


eet 
3 3 ik: (r—-R) 3 4.144 
V. = Cg | Cray | Rare d*k ( ) 


10. Two- and Three-Center Integrals 181 


The r integral has the value (47/k?) e*'". substituting this value gives 


7 
V, = ¢ = | d?r’ e-ar" | ek OR a ai atk (4. 145) 


Next, it is possible to do the r’ integral using (4.127), to get 


Vi.=e at 


"Da J Biers ek (4.146) 


To evaluate this integral, start from one already evaluated: 
—_—— — —aR __ —aR 
| Tene ‘Rk = a (2— 2e aRe~“*) 


and differentiate twice with respect to a”. This gives six times the required 
integral on the left. On the right, the differentiations yield 


an py [48— e778 (48 + 330R + 90?R? + oR) ] 


Therefore, 


_é& — o~arR ei 3 2 P22 ) 
V,=§ 1 e (1+7gaR+ he oR? + o@R*)| (4.147) 


The total potential energy of interaction is then 
€2 
V = R + V; + V2 


2 
= Fee | 1+ aR—F oR 


R 16 48 =5 oF R8 (4.148) 


Another result which may be derived from the above calculation is 
that of the potential energy of interaction of two charge distributions both 
of which are centered on the same point. This result, which is needed 
for one of the standard calculations of the energy of the helium atom, is 
the limit of V, as R — 0. It is, therefore, 


= fea (4.149) 


Problem 4-11: Calculate the Coulomb interaction 
energy between two p-state hydrogen atoms, for which the electron 
charge distributions are, relative to their respective nuclei, 


o 
£. z e7 Fr 


182 Fourier and Laplace Transforms 


Consider the two cases: 

(i) in which the z direction is along the line joining the two nuclei; 

(ii) in which the z direction is perpendicular to the line joining the 
two nuclei. 


Problem 4-12: Calculate the integrals 


{ eevieote a (Te) Pp (Te) dr, d°r, 


Yeo 
and 
| Pal¥y Pp ee (rz) » (To) d*r, d°r, 
12 
where 
Yao(r) = V a en airtel 
and 


R=r,—-TF, 


(These integrals enter into the calculation of the energy of a hydrogen 
molecule by the Heitler—London method.) 


11. Laplace Transforms 
The Laplace transform of the function f(t) is 


LF ()) =f" f(t) ede (4.150) 


where f (t) is assumed to be zero when the variable t < 0. This exists only 
for values of s such that the integral exists, say for Re s > a, where “Re” 
means “real part of.” Thus, for any polynomial it exists for Re s > 0, but 
for f(s) = e” it exists only for Res > b. 

Let us first calculate some theorems concerning Laplace transforms. 


12. Transforms of Derivatives 


The transform of f’ (t) is determined by integration by parts: 
LU WI= fo feat 
= [f(t) e*]o + sF(s) 
F(s) being the transform of f(t). Therefore, 
L(f'(t)] =—f(0) +sF(s) (4.151) 


Thus, the Laplace transform, unlike the Fourier transform, contains the 
“initial condition” on the function f (2). 


14. Convolution Theorem 183 


The derivative theorem may be applied repeatedly; for example, 


LF (t)] =—f' (0) +s.7[f' (t)] 


= —f'(0) —sf (0) +s?F(s) (4.152) 
and in general 
L[f™(t)] =—fe-Y (0) — sf"-2(0) - -- 
—s®’-Yf (0) + s"F(s) (4.153) 


Problem 4-13: Prove (4.153) by induction. 


13. “Shifting” Theorem 


This theorem concerns the transform of the function f(t—), 
(t) > 0). It is easily seen to be 


Uf (t—t)] = fo f(t) eat 
=f. f (t—ty) e~* dt 
= fr f(t’) e-™ dt’ es 
=e L[ f(t)] (4.154) 


14. Convolution Theorem 


As in the case of Fourier transforms, the transform of a convolution 
is easily calculable in terms of the transforms of the functions from which 
itis formed. By “convolution” in this case we mean 


C(t) =[ AC) &G-¢) ae! (4.155) 


The upper limit of t’ = t arises from the fact that the functions are equal to 
zero for negative values of their argument. 
The transform of the convolution is, by definition, 


LIC()|= fo ae e«| f A(t) A(t-v’) ar'| 


This is a double integral whose range of integration in the (tf, t’) plane is 
as shown in Fig. 4.4; it is the cross-hatched region above the diagonal. 
Reversing the order of integration gives 


_— ° I - —st _ +! , 
£(C()] =f. fi(t')| J, htt 1) de| 
In the inner integral, substitute 


t—r =’ (4.156) 


184 Fourier and Laplace Transforms 


Figure 4.4 


This leads to the result that 
YIC(t)] _ i fit? f, ese ree (t’) as" dt' 
= F,(s)F.(s) (4.157) 


That is, the transform of the convolution is the product of the transforms 
of the functions from which it is formed. 


15. Some Simple Transforms 


In this section we evaluate a few simple Laplace transforms which 
are frequently met in physical applications. 


(a) fi) =a()=1, ¢t>0 
= 0, t<0O 
Sf (0(t)) =/ (4.158) 
(b) f(t) = 0(t)t" 


L(0(t)t) = fr ft" e-st df 


n! 
— nti (4.159) 


15. Some Simple Transforms 185 


(c) f(t) = cos at6(t) 
(d) f(t) = sin at6(t) 


These can be derived from 


Lf (eit) = i e-st giat dy 


1 
sS—ia 
_ stia 
+a’ 
Therefore, 
ee ana 
-& (cos at) Sag (4.160) 
and 
a 
# (sin at) = ss (4.161) 
(e) f(t) = e-0(t) 
In this case 
1 
| ne Se 
of (ee) es (4.162) 
(f) f(t) = e-™* cos at6(t) 
This one can be done by replacing s by (s+) in (4.160); we find that 
~ot Pees. 
f (e~" cos at) = (stb) +e (4.163) 
(g) f(t) = e™ sin at6(t) 
By the same procedure as in (f), we obtain 
—0t oi eh eS ans 
-f (e~™ sin at) (stb +a (4.164) 


The above transforms form the basis of a table which can be read 
either to give the transforms from the functions or the functions from 
their transforms. Using it in the latter way, it may be shown that they 
permit us to calculate the function whose transform is P(s)/Q(s), where 
P(s) is a polynomial with real coefficients in s and Q(s) is a similar 
polynomial of higher degree. 

To prove that this is the case, let us suppose that Q(s) is a poly- 
nomial of degree n. It then has 7 zeros, so that it is possible to write it as 


Q(s) = a(s—5s,)(s—s_) +++ (S—Sy) (4.165) 


186 Fourier and Laplace Transforms 


Because Q(s) is real, any complex roots will occur in pairs; that is to say, 
if 

$; = a;+ ib; (4.166) 
is a root, so is a;—ib;. The two linear factors can then be combined into a 


quadratic one: 
(s—a,)? +b? 


Let us first suppose that there are no repeated roots. Then we may 
expand 
P(s) 


F(s) = O(s) (4.167) 
in partial fractions: 
A; B,+ Cys 
(s) SS; ~ (s— ax)? +b,’ (7188) 


where s; are the real roots of O(s) a a; + ib, are the complex ones. The 
coefficients A;, B,, and C; may be obtained in terms of P(s). If we first 
multiply across by s—s;, and then put s = s,, we get the constant 4;: 


= lim = (4.169) 


Both numerator and denominator of (s — s;)/O(s) approach zero as s > §;, 
so we may use L’Hospital’s rule to get 
P(s;) 
= O'(s,) eee 
Equation (4.170) may in turn be used to get the terms in the partial 
fraction expression from two roots s, and sj which are complex conju- 
gates of each other; together, they are 


P(s,) 1 P(sx) 1 
O'(s,) s—S, Q' (sé) s—sf 


— [P(sn)Q' (si) + P(s#)O (sx) 15 —P(s*)O' (Se) Sx —P (Sn) Q" (Si) Si 
QO’ (s,)Q’ (si) [(s — a,)? + by] 


(4.171) 
where 


Sk = ay = ib,. 


Comparing this with (4.168), it is found that 


— P( si) P(s;) 
Bi aluron Se + OF (sq) st aie) 


and 


— P(se) | P(si 


15. SomeSimpleTransforms 187 


This may be written in the alternative form 


P 
B, =—2a, Re ( Esk) — 2b. Im (se) (4.174) 
and 
P 
C, =2 Re aoe (4.175) 


To obtain the inverse of (4.168), that is, the function of which F(s) 
is the transform, we need only write the numerators In the second sum in 
the form 


By, + Cys = By + Cydy + Cy (5 — %) (4.176) 
It is then evident that 
f(s) => Ayes + DO C,e%! sin byt + >) a (By, + Cray) e%' cos byt 
. = (4.177) 


Although this approach can also be extended to deal with the case 
of repeated roots, the problem may be treated more naturally by the 
inverse theorem, which we now discuss. 

It is possible to see how to invert a Laplace transform by first starting 
from the Fourier integral theorem: 


f(t) =f" d(w)e™ do (4.178) 


where . 
(@) =-/ f(the dt (4.179) 


Consider a function g(t) which does not have a transform by virtue of the 
transform integral diverging at + ©. Suppose, however, that g(t)e~™ has a 
transform for large enough o. Then consider the transform of 27rg(t)e~-“A(t) 
[or, alternatively, of 27rg(t)e—, where g(t) = O fort < OQ): 


$(o+iw) = f g(t)e-orion dy (4.180) 


We write s=o+iw, and call it the Laplace transform variable. The 
theorem expressing the function in terms of its transform is, then 


27g (t)e""8(t) = ie (ao +iw)e dw (4.181) 


The integral on the right-hand side may be considered as an integral over 
s along a contour parallel to the imaginary axis: 


g(t)6(t) = = ( ™ o(s)e* ds (4.182) 


The integral on the right must, of course, vanish for t < 0. Now, in this 


188 Fourier and Laplace Transforms 


Figure 4.5 


case it may be evaluated as a complex integral by using as a contour a 
semicircle with center at A and aradius which approaches ©, and extending 
to the right of the dotted line LM representing the range of integration (see 
Fig. 4.5). The contribution to the integral from the semicircular contour is 
zero by virtue of the factor e~ "os 9+ #sin ®t, Where R = |s —o| — o. For the 
integral to be zero, it is then necessary that there be no poles in ¢(s) within 
this contour; that is, to the right of LM. The contour of integration may 
then be shifted to the right without affecting the value of the integral. It 
may be shifted to the left, however, only so long as it remains to the right 
of all singularities of @(s). This consideration must then determine the 
choice of contour when a transform is known and the function from which 
it is derived 1s to be found. 

With this understanding, (4.182) is the expression of the Laplace 
Inverse Theorem. 

The inverse theorem of course provides an alternative means of 
evaluating the functions whose transforms are quotients P(s)/Q(s) of 
polynomials. However, it can be used for a wider class of transforms 
o(s). If @(s) has no branch points, the integral (4.182) is 2ai times the 
sum of the residues at the poles. If there is an nth-order singularity at s;, 
the residue is obtainable first by writing 


$(s) =7 =a [s—)"6(5)] (4.183) 


16. Laplace Transform of a Periodic Function 189 


and then picking out the coefficient of (s—s;)"~! in the Taylor’s series 
expansion of (s — s;)"@(s)e“. The residue is then 


oy (1) = Galea tov] (4.184) 


The Laplace inverse is, then, 


f(t) => a(t) (4.185) 


Of course, for each pole s; the order n of the singularity has its distinct 
value. 


The derivative in (4.184) may be evaluated by Leibnitz’s rule, giving 


ay ] —1—m psit 
a(t)=> aioe Lae — {(s—5;) “#(s)}| pri-mesit (4.186) 


m=0 S=Sj 


the factors (n—1)! having cancelled in numerator and denominator. 
Equation (4.186) applies, of course, whether the poles s; are real or 
imaginary. The imaginary ones may be combined in complex conjugate 
pairs to give the real function f(t). 


16. Laplace Transform ofa 
Periodic Function 


Consider a function ¢(¢t) defined on the interval 0 <= t < T. Construct 
the function f(t) which is equal to #(t) on that interval and defined by 
periodicity for other positive Nanas of t. Its Laplace transform is 


F(s) = > fc o(t—nT )e~ dt (4.187) 
In the nth term, we may substitute ¢’ = t—nT to get 
a T 
F(s)= > ew? [ p(t)e “dt (4.188) 
n=0 0 
Designating the integral as ®(s), this is 


F(s) = ®(s) > een 


= @(s)5 (4.189) 
In inverting (4.189), there are contributions from all of the poles of the 
second factor, (1 — es? )—!. These poles are at 


oon (4.190) 


S=5,= 


190 Fourier and Laplace Transforms 


where vn is any arbitrary positive or negative integer. To find the contribu- 
tion to the inverse from these poles, we must calculate the residues at the 
values s,. Putting 

S=5S,+€ (4.191) 


we see that 
1 —_ e7sT = 1 a e SntOor 


= (s—s,)T (4.192) 
for € small. The residue is, therefore, 


D(s,) 
T 


The total contribution from these poles is then 


fit) = Dane 


ae (4.193) 


= + > ® F ) e2mintiT (4.194) 
The coefficient of e27”/T is 


+ ® (22 i) = an ri (t) e727 dt (4. 1 95) 


We see, therefore, by comparison with (4.7) and (4.11), that (4.194) is the 
Fourier series expansion of the function f(t). It follows that, for positive 
t, there are no other contributing poles to the inverse transform 


27 


fo=a5 ®(s) [8 ads (4.196) 


where, in this case, o@ may be taken to be any value greater than zero. 
That there are no such poles is evident in any case, since if there were, 
they would contribute a negative experimental behavior to ¢(t) when we 
invert ®(s); we know, however, that @(t) is precisely zero for t > T. 


17. Resonance 


Consider the second-order linear equation describing the reaction of 
a damped oscillator to an external stimulus f(t): 


17. Resonance 191 


d? 
Gath +t = f(t) (4.197) 


This equation may be solved by taking its Laplace transform. Initial 
conditions will be assumed 


x(0) = Xo (4.198) 
x(0) = v, (4.199) 
The transform of x(t) will be designated X(s); that of f(t), F(s). The 
transform is, then, 
— Ug — SXq + S2=X — xy + ASX + w2X = F(s) (4.200) 
so that 
_ Ut(sta)x F(s) 
xX = 2 We To EN 
(S) oe isto? TS tistor 


The first term represents a response to initial conditions, and thus has a 
transient character. Its only poles are 


__A ; 2__»* 
Ss 5 tivo ri (4.202) 


where we have assumed w* > )2/4. These roots are called s, (plus sign) 


and s, (minus sign). The contribution from these poles to the inverse of the 
first term of (4.201) is 


(4.201) 


x (t) — Vo t+ (Si +A) Xo esit 4 Vot (s2 + A)Xo A)Xo eszt 
, S31 — So S_— Sy 
at Sil $2t sit 
= (v9 +x) = — 2 + x, 2 (4.203) 
So — S; AY — S$, 


Putting in the values of s, and s,, the transient terms in the solution are 
seen to be 


x, (t) = eM? {(% + A) Sin Got Xo COS wot} (4.204) 
0 
where 
2 
Wy = y/ w? a (4.205) 


So far as the contributions to x(t) from the second (forcing) term of 
(4.201) are concerned, they came from the poles of F(s) as well as from 
s, and s,. Let the poles of F(s) be s;, s* (i= 3,4,...), and the residues be 
b;, b*, respectively. The contribution from such poles is 


— b; it b#* i 
x(t) => EF Faster’ + Seas? rari (4.206) 


i 


192 Fourier and Laplace Transforms 


The contributions from s, and s, = s* are, on the other hand, 


il 2t 
X(t) = F(si)em" , F(so)es (4.207) 
S$, — So Sg — S$, 
In view of the fact that 
s*+dAs+ ow? = (s—s,)(S—Se) 
= (s—s,)(s—s;) (4.208) 
the denominators in (4.206) are 
D, = (s;— 51) (s;—s}) 
and 
D, = (si —s,) (s# — sf) = Df (4.209) 


respectively. While the terms (4.207) are again transient, if s; is purely 
imaginary [that is, if f(t) has a periodic component], (4.206) gives a non- 
transient contribution. Putting 


b; = B,e**: (4.2 | 0) 
this contribution Is 
= Bie™ iw;t Bie ~iw.t 
i ak EP YET 9 acs gt =U OE il 
(4.211) 
where 
S; = 10; (4.212) 
With some simplification, this is 
— 1 iajt+5j) 4 ee eee “tt 
“2 (t) Bi laa + 42 — w;” + INQ; 5 Wo” + 42 i w;” = INQ; . 
2B; 
= (oof)? + Bat [ (w? — @,7) cos (wt + 6;) + Aw; sin (w;t + 4;) ] 
(4.213) 
As an example, consider the case in which 
f(t) =A COs at (4.214) 
Then 
AS 
= 4.215 
F(s) Ft+oe ( ) 


The poles are 


5s; = IW, and si = — 10, 


18. Vibrating String Problem 193 


and the residues are 
b, = 


The solution for x, (t) is, therefore, 


A ; 
Xe (t) = (o — 0,7)? +0? [ (w? — w,?) COS w, t+ Aq, sin w,t | (4.216) 


Putting 
AW 
tan %1=— at 
this may be put in the form 


— A 
(w? — 02)? + Moe 


The resonant behavior at w, = w is readily seen. At this resonance y, = 
a/2, so that 


X2(t) = cos (w,t—y;) (4.217) 


x,(t) = on Sin w,t (4.218) 
Both the “‘height”’ and the “width” of the resonance are determined by the 
damping constant A. The smaller the value of A, the higher and narrower is 
the resonance. 

The key to the analysis of the resonant behavior is in Eq. (4.211), 
where it is seen that the poles at + iw; pass near to the “‘natural’’ poles at 
—(A/2)+ iw), when @; is in the neighborhood of w). The denominators 
then become small, leading to a peak in the magnitude of x,(?). 


18. Use of Laplace and Fourier 
Transforms to Solve the Vibrating 
String Problem 


Consider the vibrating string equation 


ey _ 1 dy 
pa ae (4.219) 
Subject to the initial conditions 
y = yo(x), t=0 (4.220) 
= v9(x), t=0 (4.221) 


If we take Y(x,5) to be the Laplace transform of y(x,t) with respect to 


194 Fourier and Laplace Transforms 


the time variable, we may take the transform of (4.219) to obtain the 
equation 
o7Y 


] 


If the string is infinitely long, we may now take the Fourier transform 
of this equation. Writing 


E(k,s) =F (Y(x,s)) (4.223) 

the equation for the transform becomes 
ps 5 wae (4.224) 

Gere ee Or ee 
where ¥o(k), vo(k) are, respectively, the Fourier transforms of yo(x), 
Uo(x). 
Thus 
__ Sot Do 

E(k, Ss) — set Ck? (4.225) 


Taking the inverse Laplace transform of this equation, and calling the 
inverse of €(k, s), n(k, ft) 


n(k,t) = Fp cos ket + Bo sun ke? (4.226) 


The Fourier inverse of the first term is 


4 { Fo (Kk) (ete! + ett) etkz dk = 43[ yo(x+ct) + yo(x—ct)] (4.227) 


To obtain the inverse transform of the second term, we note that v (x) 
replaces yo(x), and that sinkct/kc is the integral from 0 to ¢ of cos kct. 
Thus the inverse of the second term is 


1 xr+ct x—ct 1 x+ct 
se |) mote) de — [O° vote) de’ J= tf ley ae” 4.228 


0 —ct 


This leads, then, to the solution obtained in the first chapter, Eq. (1.17): 


x+ct 
y(x,t) =3 vote + ct)+ yo(x —ct) + 1 Vo (x’) dx’ | (4.229) 


zr—~ct 


Problem 4-14: Verify that the same result may be 


obtained by taking first the inverse Laplace transform of (4.225), followed 
by the inverse Fourier transform. 


If the string is finite and of length L, it is possible to proceed from Eq. 


19. The Gamma Function 195 


(4.222), taking the Fourier series decomposition of the equation rather 
than the Fourier transform. If the string is of length L, and the ends are 
fixed, we write 


co 


2 . 27nx 
Y(x,s) = 2 é,(s) sin L (4.230) 
where : 
C,,(s) -7/ Y(x’,s) sin =a dx’ (4.231) 
0 


We also let the mth Fourier coefficients of yo, v9 be a,, b,, respectively. 
Then, substituting in (4.222) and equating coefficients, it is found that 


2 2 
— (77) é,= —54,-3ht5 oi (4.232) 
This may be solved to give 
7 _ ——- S$Qn +b, 
é,(s) a set (2arnc/L)? (4.233) 


Taking the inverse Laplace transform, which we designate c,(t), we 
obtain 


2anct L . 2anct 


C,(t) = a, COS L Toe b,, sin L (4.234) 
Then 
= . 27Tnx 
y(x,1) = cals) sin“ 

_ 27nct L . 2arnct 277nx 

=> (a, cos + ne b,, sin a a cos — (4.235) 
19. The Gamma Function 
The gamma function Is defined for positive argument as 

P(e) = fo tetas (4.236) 


This is met in calculating the Laplace transform of a general positive 
power of f: 


J © portenst qr = U@) (4.237) 


0 s* 
If ais an integer, for example, a = n, 


['(n) = (n—-1)! (4.238) 


196 Fourier and Laplace Transforms 


On integration by parts it is easily verified that 
P(e) =— f* mtd(e*) 


=— [tee + (a—1) f to? e~* dt 
= (a—1)P(a—1) (4.239) 


so long as we are dealing only with functions of positive argument. 

If we wish to extend the definition of the I function to negative 
arguments, we may do so by using (4.239). Thus, for instance, from this 
formula 


P-42) = (=43) (=33) (=23) (=r) 0a) FO® 


We see, then, that the [ function is defined for all arguments by (4.236) 
with 0 < a < 1, along with (4.239). It is easily seen that ['(z) has simple 
poles at z = 0 and at all negative integral values. 


Problem 4-15: Prove the following formulas: 


| t=-1 cos t dt = T(x) cos “5 
0 
WX 


| t™—1 sint dt =T'(x) sin 
0 2 


20. Stirling’s Formula 


This is a very useful formula in statistical physics, which gives an 
asymptotic form (that is, a form for large argument) for the I’ function. 
It is of particular interest for the case of integral argument (factorial). 

In order to find it, we use a technique known as ‘“‘method of steepest 
descents.’’ We first write 


T(zt 1) — zt in e272 dt 
0 
= yet f * pnatt-Ind) dy (4.240) 
0 


We consider, then, the function (z+ 1) /z**1 defined by the integral. 

Our aim will now be to deform the contour so as to simplify the 
evaluation of the integral for large |z|. We note that the integrand is par- 
ticularly large where Re z(Int—12) is positive. But if |z| is large, the imag- 
inary part will then in general produce very rapid oscillations of the 


20. Stirling’s Formula 197 


integrand which will make evaluation very difficult. This difficulty may be 
overcome if we can find a contour on which the imaginary part is a 
constant. 

Now it is well know that for a function of a complex variable (in 
this case ¢) the contours on which the real and imaginary parts are con- 
stant are orthogonal at every point. But the contour on which the imaginary 
part is constant is then in the direction of the gradient of the real part. If 
the real part is a maximum at t= to, its value will then decrease at a 
maximum rate along this contour. 

We see, then, that if we take such a contour through f, the largest 
part of the integral will come from the immediate neighborhood of f. 

Let us now carry this out for the case in point. The value fy is deter- 
mined by 


d 
gq (int-2) = 0 
or 
t=t=1 (4.241) 


In fact, then, in this case it is not necessary to deform the contour at all; 
we simply expand 


Beige eee -1)| ie 
Int—t=—-1+ 5 qe (int t) hare 
(t—1)?_,. 
=| ee sige + higher powers (4.242) 


Assuming for the moment that when the real part of z is large the integral 
comes predominantly from the region of the saddle point t = 1, it follows 
that it is approximately 


P(zt1) = tttent [" earn ds (4.243) 


Putting (t— 1) /V2 = rinthe integrand this becomes 


I(z+1) = z7*! e V2: e~?” dr 


—1/v2 


= 72+] oz V2 fi e2? dr 
= Var get e~2 (4.244) 


This is Stirling’s formula. For z= the integer n it is often written in 
logarithmic form: 


Inn! = (n+3) Inn—n+4In27 (4.245) 


198 Fourier and Laplace Transforms 


That this is increasingly accurate the larger the value of z (or n) Is 
easily seen from the following facts. 

(a) The real part of the argument z (In t— 1) has a single maximum at 
t= 1; it is monotonically decreasing (that is, takes on increasingly large 
negative values) as t moves away from unity in either direction. 

(b) The argument of the exponential in the integral is 


aan 2 
:|- 1 os a3(t—1)?+a,(t—1)*+-- | 
In evaluating the integral, one may put 


suet = (4.246) 


In terms of this variable this argument is 


2\ 8/2 
au (4.247) 


~2-#+¥ a, 


s=3 


(c) By virtue of the factor e~”’, the integrand is negligible for u > up 
(for instance, for uy between 5 and 6, e~“” = 107‘). The higher terms in 
(4.247) are, then, negligible provided 


Zz > 2uy? (4.248) 


21. The # Function 


The 6 function is a function of two variables: 


B(m,n) = J s™1(1—s)"" ds 


=2 ia sin?”—! @ cos?”—! 6 dé (4.249) 


on putting s = sin? 0. This may now be expressed in terms of I functions 
as follows: Consider the product 


C(m)T(n) = foe e-t dt f py") e-? dy (4.250) 


The variables in this double integral may be written in terms of polar 
coordinates: 


t= r? sin? 6 (4.251) 
v=r* cos? 6 (4.252) 


which gives 


P(m)0(n) =4 fo rdr f dor2+2"-2 sin?™-1 8 cose (4.253) 


22. Use of Transforms for Equations with Linear Coefficients 199 


ing r? = x, sin? 6 = s, and writing the right-hand side as the product of 
integrals, this takes the form 


ro 1 
== m+n— -=zr m—1 eae -1 
T'(m)T(n) =f, geranl dx J s™1(1— 5s)" ds 
‘e the second integral is the 6 function, it can obviously be expressed 
ie form: 


I'(m)1'(n) 


B(m,n) = (m+n) 


(4.254) 


It might be noted in passing that if m and n are put equal to 3 in 
53), 


[r(s)]? =4 f, re-” dr r dd = 7 
nat 
T(4) = Var (4.255) 


se formulas involving y and 8 functions are used in the discussion of 
Bessel function below. 


22. Use of Transforms for 
Equations with Linear Coefficients 


A linear differential equation with constant coefficients can always be 
‘ed by the use of transforms, since the transforms of derivatives of a 
>tion are proportional to the transform of the function itself. We shall 
in this section that equations with coefficients linear in the independent 
able can also be solved by means of transforms. 

Consider a differential equation of the form 


[xf (D) + g(D) ly =0 (4.256) 


re f (D) and g(D) are linear differential operators, D being synony- 
1s with the derivative operator d/dx. If y is expressed in the form 


y= in e=F(s) ds (4.257) 


f(D)y = [" ef (s)F(s) ds (4.258) 


xf (D)y = | ” £(5)F(s) fest ds 


= [f (s)F(s) evie—[ e= 2 {f(s)F(s)}ds (4.259) 


200 Fourier and Laplace Transforms 


Equation (4.256) is then satisfied by choosing the function F(s) and the 
limits s, and s, so that 


[F(s)F(s) ele |" | (f(s) F()}—2(s)F(s)|ds=0 4.260) 
The procedure is now to choose F(s) so as to make the quantity in the 


square bracket zero; once F(s) is known, the limits may be chosen to 
make the integrated term vanish. Putting 


f (s)F(s) = o(s) (4.261) 
we must first solve 
db _ g(s) 
ae = £4) —— (5s) (4.262) 
The solution is 
o(s) = exp Ca ds' (4.263) 
Thus 
F(s) = exp [ £422 as 4.264 
(8) = Fer | Haye ae 
The integrated term is 
Jexp (sx-+ re 5 ds' Pr (4.265) 


The choice of s, and s, to make this zero will, of course, depend on the 
functions f(s) and g(s), and is in general not unique. 

Before turning to particular applications, we make two observations: 
first, that y in (4.257) has the form of a Laplace transform (that is, it is a 
continuous linear combination of exponentials). Taking s purely imaginary 
(s = it) it would, however, have the form of a Fourier transform. Since 
the choice of contour and of limits of integration is determined by the 
calculation, it cannot be said in advance which form will emerge. Second- 
ly, if we tried a similar technique for an equation with quadratic coeffi- 
cients, or ones with still higher powers, the method would in general not 
be a feasible one to employ. For if there were a term of the form x2h(D), 
we would have 


x*h(D) f= x? [° e*=h(s)F(s) ds 
= [° h(s)F(s)- 55 es* ds (4.266) 
It would then be necessary to integrate by parts twice. This would lead to 


two integrated terms— which would create no fundamental problems— 
and to the integral term 


[ eS {h(s)F(s)} ds (4.267) 


23. The Confluent Hypergeometric Function 201 


Adding such a term to (4.260), we see that F(s) is determined now by the 
solution of a rather complicated second-order differential equation. In 
general, there is no reason to believe that this will be easier to solve than 
the original equation. 

Though the method seems, therefore, to be applicable only to equa- 
tions with linear coefficients, it should be borne in mind that in fact it 
applies to all equations which can be deduced from these by a transforma- 
tion of either dependent or independent variable, or both. 


23. The Confluent 
Hypergeometric Function 


The confluent hypergeometric equation is 


xy” + (y—x)y'’— By =0 (4.268) 
This is a particular case of (4.256), where 
f(s) =s?—s (4.269). 
and 
g(s)=ys—B (4.270) 


The exponent in (4.264) is then 


[ S£as-f [F248] 4 
oss gles ss 


= (y—B) In(1—s)+fIns+constant (4.271) 


It follows that 
F(s) = (1—s)*-8"'s8 (4.272) 
The integrated term in (4.260) ts 
[(i=s)”*s oe ]2 (4.273) 


If s,; and s. are chosen to make this zero, the solution of the differential 
equation is me 
y=C J. (1—s)%#-158-lest ds (4.274) 
where C is aconstant. 
Consider the case in which y > B > 0. Then it is possible to choose 
s,;=0, sy =1. The solution then has an essential (exponential) singu- 


larity when x — +, A series solution may be obtained by expanding the 
exponential: 


[> 2) n 1 
y= C Ss at I (1 —s) y¥-B-1,8+n-1 dy 


n=0 


_nvnw l(y—B)E(B+n) x” 
i T(y+n) n! 


202 Fourier and Laplace Transforms 


If we write 
P(B+n) =B(B+1)...(B+n—1)F(B) 
= B,1'(B) (4.276) 

where £, is the product of n factors, the first of which is B and each subse- 
quent one is one greater than the preceding, the solution may be written 
T(y—B)T(B) 2 Bax” 

l(y) Yn ni! 
The constant C may be chosen so that the factor in front of the sum is 


unity. The solution resulting from this choice is the ‘“‘confluent hypergeo- 
metric function” 


yi(x) =C 


n=0 


y, (x) = F(B, y, x) 


[> 2) 


xn 
=> on (4.277) 
n=0 (2° 


With the value of C so chosen, the integral representation of the confluent 
hypergeometric function is 


l'(y) : 
EUG Vx) =a 1—s)%-8-1s58-lesz d 4.278 
ERG pier nee Sere 
This form is valid except when either 6 or y — B is negative or zero. 


Problem 4-16: Show that when 8B is a negative 
integer, —r, a solution may be found in the form of a contour integral 
around the origin 


(1 = Ss) Y+r—-1 


F(—r,y,x) =a ora ds 


By evaluating the contour integral show that 
a ee a A ee nd 6 a Go 
Fonyx) = 2mia(—Ir Ye) 


where (2 ) is the coefficient of x? in the expansion of (1+ x)”. 


Using the condition that F(—r,y,x) = 1 when x= 0 determine a, 
and hence show that 


FOry,x) => (7, SP 


m=0 ue Ym 
_ 1 2 wr! (1—s)%+7-1 
Dari (—1) Yr gttl ens ds 


If y—8 is a negative integer but 6 is not, show that the substitution 
s' = 1—s may be used to derive a contour integral representation. 


23. The Confluent Hypergeometric Function 203 


While this solution has been obtained in the first instance on the 
assumption that y > 6 > 0, the fact that (4.277) satisfies the differential 
equation cannot depend on the values of 8 and y in general, so that it 
provides a solution in all instances except when y is a negative integer. 

When y = — m, a negative integer, it is evident from the series form 
(4.275) that, due to singularities of the denominator, all the terms of the 
series are equal to zero up to the term for which n = m+1. Therefore, 
at the origin the solution behaves in this case like x™t?. 


Problem 4-17: Prove that 
g F fp +1l,y+1,x) 
dx BS (B a XxX 


Let us substitute 
y=xmtly (4.279) 


A simple calculation shows that uw in turn satisfies a confluent hypergeo- 
metic equation 


xu" + (m+2—x)u'—(m+1+B)u=0 (4.280) 
The function u(x) therefore has the form 
u= BF(m+1+8,m+2,x) (4.281) 


Problem 4-18: Prove that 
F(B, y,x) = e'F(y—B,y,x) 


The value of the constant B which relates this solution to (4.275) may be 
found as follows: Using the known value of C, 


I'(y) 


oT) O=B) rane 
in Eq. (4.275), we find that 
PO) => et) x (4.283) 


~T(B) T(ytn)n! 


i coefficient of the leading term as y — —™, that is, the coefficient of 


n=0 


y1 r(p+m+1) — Bm-+1 
lim lim Fo)x! F(g)Gn+l)i (n+! 
But the limit of y, given by (4.279) and (4.281), is simply equal to B. 
Therefore, 


F(B, y,x) = Pmt xt F(m+14+8,m+2,x) (4.284) 


lim Fy ) (m+1)! 


204 Fourier and Laplace Transforms 


Problem 4-19: From the integral representation 
(4.278), show that, for large positive x 


F(p.y.a) ~ £522 


provided that y and B are not negative integers. 


We are, of course, able to conclude, from the fact that their leading terms 
are equal, that the functions on the right- and left-hand sides are equal for 
ali x, since they satisfy the same differential equation. 

We have now shown how one solution of the confluent hypergeo- 
metic is found, for arbitrary values of 8 and y; (4.275) provides a solution 
for y * —m and (4.284) for y =—m. Let us see next how to find a second 
independent solution. 

If we attempt to find a solution of the confluent hypergeometic of the 
form 


y = x*w(x) (4.285) 


where w(x) is finite and analytic at the origin, we find, on substituting in 
the differential equation 


x[x*w" + 2kxklw! + k(k— 1)x*-2w] + (Cy — x) [*w! + kw] — Bx*w = 0 
(4.286) 


Equating those terms which are predominant near the origin, that is, those 
which behave like x*—!, we find that 


k(k—1)+ky=0 
so that k = Oor 


Since the confluent hypergeometric function corresponds to the case 
k = 0, consider the other case. Putting k = 1—vy in (4.286), and dividing 
through by x*, we find for w(x) the equation 


xw" + (2—y—x)w’—(1+B—-y)w=0 (4.288) 


This is again a confluent hypergeometric equation. Therefore, the second 
solution is 


Yo(x) =x! "F(1+B-—y,2—y,x) (4.289) 
The general solution, then, except in special cases to be noted, is 
y = Cyyy (x) + Ceye(x) 
= C,F(B, y,x) + Cx! F (1+ B—-y,2—y,x) (4.290) 


24. Laguerre Functions 205 


It is interesting to note that, when y = — m, y.(x) becomes x”"*1F (m 
+1+8,m+2,x), which is, to within a constant, identical with (4.284). 

The problem remains, then, how to find a second solution when y is a 
negative integer. Such a solution may be written down formally using 
(B.7) of Appendix 3B. The Wronskian is 


w= exp| (1-2) dx 


ee (4.291) 
Knowing y,, it follows that a second solution is 
_ =W(x') ,, 
y y2(x) | yo?(x’) x 
ag ex’ dx’ 
= y, So 2 
Y2(x) | x'mt2F2(m+1+B,m+2,x') (he 72) 


Since e7/F?(m+1+ 8,m+2,x') can be expanded in a series about the 
origin, we see that there is a logarithmic term in the integral. The other 
solution is, therefore, not analytic in x, but has the form 


y(x) = Coyo(x) Inx+ power series 


C, being a constant. 

A remaining case in which only one solution has been found is that in 
which y = 1, since, as may easily be seen, for this value of y, y. = y;. In 
this case, too, the second solution may be found in terms of the first from 
(B.7) as above; it has the form 


er’ dx’ 
x'F?(B, 1.x’) 
and again has a logarithmic singularity at x = 0. 


We now turn to some special cases of functions related to the con- 
fluent hypergeometric function. 


y = F(B,1, x) is (4.293) 


24. Laguerre Functions 
The Laguerre equation is 
xy"+ (at1—x)y’+ny=0 (4.294) 
where we assume a to be >—1. This is directly a confluent hyper- 


geometric; one solution is then 


y= L,@(x) = fot Dep n,a+1,x) (4.295) 


206 Fourier and Laplace Transforms 


which is designated the Laguerre function. The choice of the multiplying 
constant is conventional. If ” is an integer, we see immediately that the 
function is a polynomial of degree n. 

The second independent solution of the equation, which has a singu- 
larity at the origin, 1s 


yo (x) = x °F (—n—a, l—a, x) (4.296) 


unless @ is zero or a positive integer. If ais zero, the solution is obviously 
identical with the first one. If it is a positive integer, formula (4.284) with 
a=m-+1 again shows that the “second” solution is identical with the 
first. In either case, an independent second, non-analytic solution exists, 
as demonstrated in the last section. 

An integral representation of the Laguerre polynomials is, following 
(4.274), 


= atn 
L,‘ (x) = (constant) Ons yr ds (4.297) 


where we have chosen as contour of integration a path enclosing the 
singularity at the origin. This may be transformed by 


Problem 4-20: ‘Taking the constant in (4.297) to be 
1/27ri, show, by making the substitution sx = x—o and evaluating the 
contour integration ino, that 


L(x) = me et (xt eT) 


putting 


=— (4.298) 


The integral then becomes 


L(x) = (constant) $ pointy aan ett) dt (4.299) 


(1+ 

around the transformed contour. Now since the origin in s transforms into 
the origin in ¢, and s = © transforms into t= — 1, a contour surrounding 
s = 0 transforms into one surrounding t = 0 but not enclosing t = —1. The 
residue at t = 0 is the coefficient of ¢” in 


-_ 1 tx 
g(t) = Wan XP Ta; (4.300) 
L, (x) is therefore 27ri (constant) times this coefficient; thus 
ere eae tx pe n 
g(t) = (1 + rt)? PT 27ri(constant) Ly (x) 


25. Hermite Functions 207 


The value of the constant may be determined by putting x = 0. From 
(4.277) and (4.295) 


L,{°(0) = fot Us (4.301) 
But 
Therefore the constant must be 1/(27ri), and we have 
aana exp er, = > (—1)" L,@(x)t” (4.302) 


The function on the left is therefore known as the “generating function”’ 
for the Laguerre polynomials. 


Problem 4-21: By expanding the exponential in 
(4.302) show that 


L(x) = > rears TV(atn+1) 1 


P(atst+1) (n—s)!s! Is! (—x)* 
Calculate the Laguerre solynomials for a=0 and n=0,1,2,3,4 and 
compare with the forms obtained in Chapter 2. 

Problem 4-22: By differentiation of the generating 
function with respect to x, show that 

Lo (x)= Liat) (x) 

Problem 4-23: By differentiating the generating 

formula with respect to t, derive the recursion formula: 


nL, (x) = (x—a—2nt 1)L@, (x) — (atn—1)L@, (x) 


25. Hermite Functions 


The Hermite equation, which arises in the treatment of the harmonic 
oscillator in quantum mechanics (Chapter 9), is 


y" —xy'+ny=0 (4.303) 

In the notation of (4.256), 
f(s)=-s (4.304) 
g(s)=s?+n (4.305) 


208 Fourier and Laplace Transforms 


Thus, 


o(s) = exp | ca ) ds' 


Sa 


= A exp (—3s?—n Ins) 
=F erumne (4.306) 
A being a constant. A solution of the equation may then be written as 
S2 
y= A | st e 112)s? osx ds 
S! 


where s, and s, are values of the variable s for which 
exp [sx—}57]/s"=0 (4.307) 


If n is an integer, the obvious contour to take is a closed loop surrounding 
the origin. If m is not an integer, there is a branch point at the origin. 
Making a cut along the positive real axis, one may choose a contour com- 
ing in from infinity on one side of the cut, looping around the origin, and 
going out the other side. The integral is then that of the discontinuity 
along the cut. 

In the case in which nv is a positive integer, the contour integral can 
be evaluated; it is —27riA times the coefficient of s” in e*%e—“/”s*_ Choosing 


—2niA =, (4.308) 
we obtain the generating formula 
est e-(1/2)s? = > H a(x) (4.309) 
where H,,,(x) is the standard solution of (4.303) known as the ‘Hermite 
polynomial,” and is —s by the integral 


H,,(x) = =o $ retreat ds (4.310) 


Problem 4-24: Differentiating (4.309) with respect 
to s, prove the recursion relation 


xH,, = AnsitnAy-1 


for n = 1. What is the relation for xH,? 


Problem 4-25: Differentiating (4.309) with respect 
to x, prove the recursion relation 


a 
A, <7 nH,- 


25. Hermite Functions 209 


Problem 4-26: From the recursion relation, write 
down the first five Hermite polynomials. 


Problem 4-27: Multiplying e%%e—0/2s’ = > H,(x)(s"/ 
n!) and e=e—/28 = > HH, (x) (t”/m!) show that 


J, Ha(x)Hm(x)e7#*? de = 0 
form # nand V 27n! for m= n. 


The Hermite polynomial may be written in another way by intro- 
ducing t = s — x as a variable. Equation (4.310) then becomes 


H,,(x) ~ Saf GE (4.311) 
. 2ain! J (t+x)"*} 
the contour now being taken around t = — x. The integral 1s 27ri times the 
residue of e~?/2” at t=—x, which is in turn n! times the nth derivative 
of e~4/2"? at t = — x. Therefore, 
H, (x) = (- I)nest2 So eW ale (4.312) 
Substituting 

4x2?=7 (4.313) 

in (4.303) leads to the differential equation 
xy" + (f—z)y' +5 y=0 (4.314) 


which is a confluent hypergeometric equation. Therefore one solution is 


2 
y= F(-3,.5.5) (4.315) 
and a second is, from (4.289), 
= x? 1/2 nal 3 x? 


In the case in which zn is even, y, is a polynomial, since B is a negative 
integer. Therefore, (4.315) multiplied by some factor B, is the Hermite 
polynomial. The coefficient can be obtained by putting x = 0: 


H,(0) = B,F (-3.5.0) (4.317) 


From (4.309) 


— (—A1)jnl2 nt 


210 Fourier and Laplace Transforms 


and since F( 8, y,0) = 1, we have 


1\n n! _A 1 x? 
H,(x) = (-3) nF oo (4.318) 


for even n. - . 
When n is an odd integer, on the other hand, it is the solution y, 
which is polynomial; we therefore identify 


eal 2 
ByxF (—" 13x ) 


jm on 
with H,,(x). B;,may be determined by equating coefficients of x; it is the 


coefficient of x in H,,(x). From (4.309) this is (—$)®-??[n!/((n—1)/2)!]. 
Therefore, 


a 1\("-P/2 n! _n—1 3 x? 
a(x) =(-3) Caer TPG) 4319) 
for odd n. 


Let us prove, finally, an interesting addition formula for Hermite 
polynomials, which can be derived from the generating function (4.309). 
Consider the formulas 


erst; e“UDsa? — > H,,(x;) 9 (4.320) 


where i= 1, 2,..., N. Multiply eaeuee the right-hand sides and the left- 
hand sides, writing 


N 
a-x= > a;x; (4.321) 
N 
a=> a?z?=1 (4.322) 
Thus 


ea xs e7 112)s? — >) H,(a- = S- 


n=0 

N 0 5” a” 
a 7 2 

7=1 n;=0 n 


Equating coefficients of s” on both sides 


H,(a-x) =n! 2 il Hy, (xi) 5 ae (4.323) 


mit "'$2yN=n i=] 


26. The Bessel Equation 211 


26. Equations Reducible to the 
Confluent Hypergeometric: The 
Bessel Equation 


In the previous section, we have seen that the Hermite equation was 
reducible to a confluent hypergeometric by a change of independent 
variable. There are also equations which may be so reduced by a trans- 
formation of the dependent variable. 

For example, let us find the equation for y such that 


y=e “xf (x) (4.324) 


where f (x) satisfies the confluent hypergeometric. Substitution of ex~‘y 
for fin the equation leads to the following differential equation for y: 


x*y"+ [(2a—1)x?+ (y—2s)x]y’ + [a(a—1)x?+ (ya—B-—2sats)x 
—s(y—s—1)]y=0 (4.325) 


A slight further generalization is possible if we seek to have y in the 
form 


y = e xf (kx) (4.326) 


This requires replacing x by kx and a by a/k in the previous equation, so 
that y satisfies 


x*y"+ [(2a—k)x?+ (y—2s)x]y' + [a(a—k)x?+ (ya—kB—2satks)x 
—s(y—s—l1)]y=0 (4.327) 
The Bessel equation 


r+ty'+(1-4) y=0 (4.328) 

xy" x? ; 
is a particular case of the above with the values 

2a—-k=0 (4.329a) 

y—2s=1 (4.329b) 

a(a—k) =] (4.329c) 

ya—kB—2sa+ks =0 (4.329d) 

s(y—-s—-l =n (4.329e) 


The quantity m is assumed to be a positive number. From (4.329a) and 
(4.329c), a=+i. From (4.329b) and (4.329e), s=+n and y=1+2n, 
where either plus or minus sign is taken in each case. Finally, from 
(4.329d), B=ty=4+n. Consequently, two solutions of the Bessel 
equation are | 

y = A,x" eF =F (n+3,2n4+ 1,+2ix) (4.330) 


212 Fourier and Laplace Transforms 


and 
= Ax” eF "=F (—n+3, —2n+1,+ 2ix) (4.331) 


It is not guaranteed that these solutions are really different. In fact, when 
nis an integer, it is shown in the section on the confluent hypergeometric 
function [Eq. (4.284)] that F(—n+4,—2n+1,+2ix) is proportional to 
x"F (n+4,2n+1,+2ix). It follows that the “second” solution (4.331) is 
the same as the first to within a multiplying constant. This may also be 
seen by writing down the standard form of the second solution to the 
confluent hypergeometric as given in (4.290). As shown in that context, 
there is an independent second solution which has a logarithmic singular- 
ity atx = 0. 


27. General Properties of Bessel 
Functions 


To study in detail the properties of Bessel functions, it is desirable to 
write the solutions in a different way. First, let us substitute in (4.330) the 
integral representation (4.278) of the confluent hypergeometric: 


yy = A,x" e- eae | g®—(2)(] — 5) n-(12) g2isx dy (4,332) 


Making the substitution 2s — 1 = ft, this takes the form 


1 
ee ep [ (1—12)"-" costxdt (4.333) 


Since the rest of the integrand is an even function, e“* has been replaced 
by its even part, cos tx. 

With the appropriate choice of the multiplying constant A,, this is the 
standard Bessel function J, (x). The constant is chosen in such a way that, 
in the series expansion, the coefficients have a simple form. The series 
expansion may be obtained by expanding cos tx: 


_ P(2n+1) 1 (—1) dante 
ee Tory re > apie] U-emametae 4.334 


Putting ¢? = u, the integral becomes 
f (1—u)"-9y!-G dy = B(n+4, 1+4) 
0 


_ P(n+3)P (+3) 
P(n+I+1) 
by virtue of (4.249) and (4.254). Thus, 


— , FQnt+1) x SC 1)! P+) 
= 4. Tah DOD! QD'T(ati+ty*®  —« 339) 


27. General Properties of Bessel Functions 213 


Now I[°(/+4) may be written in terms of factorials: 


rata = GED GI-3) 1 r(2) 


21)! 
= es ve (4.336) 
so that 
= S74 a Coe | x\?! 
m= VIA Teh mL Tatled (5) we?) 


The normalization of the standard Bessel function is now chosen so that 
all that remains multiplying the sum is (x/2)”; that is 


2°T (n+3) 
A, = 4.338 
; Val (2n+1) ( ) 
It follows that the series expansion of the Bessel function is 
_ oo (— 1)! l (gy 
I) = BN Patty \2 2) 


and that its integral representation 


___2___—s(x\" erm 
Jn (x) Val (nth) @) i} (1—??) cos tx dt (4.340) 

The usual treatment of the Bessel function begins by solving the 
differential equation by the Frobenius method to obtain the series expan- 
sion (4.339). Unfortunately, the series does not give much insight into 
the properties or behavior of the Bessel function. The point is perhaps 
best illustrated by observing that the series expansion of a sine or cosine 
does not reveal in any obvious way the oscillations or the periodicity of 
those functions. The integral (4.340), on the other hand provides consider- 
able insight into the character of the function, as we now show. 

This insight derives from the fact that the integral represents simply 
a superposition of harmonic functions or waves; the lower limit represents 
waves Of infinitely long wavelength and the upper one corresponds to a 
wavelength of 277. The amplitudes of the waves in the superposition are 
given by 

(1-1-4 = (1 ol “<; (4.341) 

where A is the wavelength (see Fig. 4.6). The following observations then 
follow: 

(a) At x = 0 the waves are all in phase; that is, they add construc- 
tively. Thereafter the phases become more and more randomly distribu- 
ted as x increases. Therefore, the integral has its maximum value at x = 0. 


214 Fourier and Laplace Transforms 


Figure 4.6 


Furthermore, as x — ©, the randomness of the phases leads to the integral 
going to zero. In the case of Jy(x), then the function has its maximum 
value, unity, at x = 0, and approaches zero for large x. 

(b) Since all of the waves start with zero phase, none of them become 
negative before a quarter wavelength; for the shortest wave this is 7/2. 
The integral, therefore, must be positive up to this value. After 7/2, more 
and more of the periodic functions take on negative values, until a point 
is reached at which the function as a whole becomes negative. For the 
case of n = 0, this is the first zero of the function; for other n’s, J,(0) =0 
because of the factor x”, but the next zero comes for x > 7/2. 

(c) For n < $, the amplitude (1 —t?)""“ is greatest near t = 1, that 
is, for the shortest wavelengths. As n increases, the shorter wavelength 
components decrease relative to those of longer wavelength. After n = 3, 
the amplitude starts at zero for t= 1 (A = 27). Consequently, as 7 in- 
creases, the first zero after x =0 will come at larger and larger values 
of x. 

The first zero of Jo(x) is at x = 2.405, which is considerably bigger 
than 7/2 = 1.527. 

By the time that x = 37/2 = 4.712, the waves with greatest amplitude 
have again become positive. Therefore, somewhere beyond this value, 
Jo(x) should again become positive. The second zero is in fact found to 
be at x = 5.520. 

(d) Though it does not follow rigorously, it seems reasonable that the 
Bessel functions would have an oscillatory character. This is a conse- 
quence of the fact that it is constructed from periodic functions, and that 
there are no components of very short wavelength. 


29. Zeros of Bessel Functions 215 


28. Second Solution of the 
Bessel Equation 


Since the Bessel equation depends only on 7’, it should be possible 
to get a second solution from the first by replacing n by —n. This is cer- 
tainly true for the series solution (4.339). The integral in (4.340), however, 
does not converge for negative n. The second solution is expressible in 
terms of a confluent hypergeometric by (4.289). 

When n is an integer, it is found that the second solution is not inde- 
pendent of the first. Since the I function of a negative integer is infinite, 
the coefficient of the first n terms inJ_,(x) becomes zero: 


- as (-1)) 1 x —n+2l 
Ja) = > SP eae 3) 


i=0 
nae —])"tm 1 x n+2m 
— >» (m+n)! (m+1) (5) 


where we have put m=I1—n. Since [(m+1) =m! and for integral 7, 
(m+n)!=T(m4+n-41), 


stan (-1)"™ x\ntom 
J_,(x) = (-1) >» m! Timp (3) 


= (-—1)J,, (x) (4.342) 

The other independent solution for n a negative integer may be 
derived by noting that (4.331) expresses it in terms of a confluent hyper- 
geometric function with y a negative integer. The situation therefore 
parallels that case, for which the essential results are given by (4.291) 


and (4.292). Since the Wronskian for the Bessel equation is 1/x, the 
second solution may be written 


Yo = Jn(x) Eacen (4.343) 


Subsequently we introduce the Hankel functions as two alternative 
independent but complex solutions of the Bessel equation. It is then pos- 
sible to derive two real, independent solutions by taking real and im- 
aginary parts. 


29. Zeros of Bessel Functions 


It is possible to derive theorems for the zeros of Bessel functions of 
any sort, since these theorems follow only from a modified form of the 
differential equation itself. 


216 Fourier and Laplace Transforms 


We must first show that it is possible by a transformation to elimin- 
ate the first-derivative term in the differential equation. The transformation 
is 


y = x ey (4.344) 


The function u satisfies the equation 
241 
u" + (1 =F) u=0 (4.345) 


This function will have the same zeros as the solution of the Bessel equa- 
tion except possibly for x = 0. 


THEOREM 1 
Between any two zeros of a Bessel function, there must always lie a 
zero of a sinusoidal function (sin x or cos x or some combination thereof). 
This is proven in the following way. Starting from the equations 


2 1 
ul" + [ ere u=0 (4.345) 


v’+v=0 (4.346) 
we multiply the first by v, the second by u, subtract and integrate between 
two successive zeros, x, and xX, of u: 


roo, x2 
(vu’ —uv') 


2 1 
= (2 +4) | = uv dx (4.347) 
21 wo | x 
Now, either u'(x,) > 0, u'(x,.) < 0, and uw is positive on the interval, or 
u'(x,) < 0, u’(x,) > 0, and uw is negative on the interval. Consider the 
former case: the argument follows in exactly the same way in the latter. 
We have, then, 


XL2 
v(xX_)u' (X_.) — v(x,)u’ (x,) = (n? +4) 5 uv dx (4.348) 
og | 
If v is positive on the whole interval (or if it is negative on the whole 
interval), the two sides of the equation have opposite signs. Therefore, v 
must change sign in the interval; that is, it must have a zero between suc- 
cessive zeros of the Bessel function. 
It may further be shown that as x — ©, the zeros of the Bessel func- 
tion approach 7, for clearly, as x > n, Eq. (4.345) becomes more and more 
nearly identical to (4.346). 


THEOREM 2 

There is a zero of J,(x) between any two successive zeros of J,4:. 
(in fact, the zeros of J,,4, are separated by more than those of J,,, though in 
both cases, the separations for large x approach 77.) 


30. Hankel Functions 217 


The theorem is proven in much the same way as the one preceding. 
The equations for the functions are 


n? +4 
Un +(1 a) Uu, = 0 
, (n+1)?+4 
Un+y (1 -a—) Unis = 9 
Multiplying the first by u,,,, the second by u,, and integrating between 
x, and x, which are now two successive zeros of u,,,, we obtain the 
equation 
2 1 
Un (Xp) Unss (Xe) — Up (X1) ner (41) = (2n+ 1) x2 UpUny, dx (4.349) 
Fi 
Let us take wi44(x,) > 0, ui41,(x%2) <0, Unii(x) > O in the interval 
X, = x <= x,. Suppose then that uw, is positive on the whole interval; the 
left-hand side is negative and the right-hand side positive. If u, were 
negative throughout, the sign of each side would be reversed. In either 
case we arrive at a contradiction. Therefore, u, must change sign (that 
is, have a zero) between successive zeros of u,.4;. 


Problem 4-28: Prove that between any two zeros 
of one solution of the Bessel equation, there is always a zero of the other. 


30. Hankel Functions 


Still a different pair of independent solutions of the Bessel equation 
may be obtained by returning to the equation (4.328) and making the 
substitution 


y=x"u (4.350) 
The function u(x) satisfies the equation 
xu" +(2n+1)u'+xu=0 (4.351) 
This is another particular case of (4.256), in which 
f(s) =s?+1 (4.352) 
and 
2(s) = (2n+1)s (4.353) 


Solutions may then be found in the form 


u= in e=F(s) ds (4.354) 
$1 
where 
__!1 *(2n+1)s' ,, 
F(s) sr exP | ayy 4s (4.355) 


218 Fourier and Laplace Transforms 


and s, and s, are any two values for which 


f(s) F(s)es* =0 (4.356) 
Doing the integral in (4.355) we find that 
F(s) = (s?+1)”-@ (4.357) 
so that (4.356) becomes 
(s?+ 1)"tOMesz — 0) (4.358) 
There are various possible choices for s, and s,. Taking s, = —i and 
S. = ithe solution is, from (4.257), 
y= Cx" ik e*(s?+ 1)" ds (4.359) 


C,, being a constant. Putting s = it and taking 


C,= (4.360) 


l 
2° Val (n+4) 
this becomes the Bessel function J, (x). 
Consider the integral around the contour shown in Fig. 4.7, including 
the factor C,,x”. Since the integrand has no singularities inside the con- 
tour, the complete integral is zero. The contribution from the portion 
running from i—R to —i—R approaches zero as R > © when x > 0 by 
virtue of the factor e~"*. (When x < 0, the contour may be drawn to the 
right of the imaginary axis.) 


Figure 4.7 


30. Hankel Functions 219 


It follows, then, that the integral from —i to i, which is the Bessel 
function J,,(x), is equal to the sum of two other integrals: 

(a) One running from i—© to i, which we designate as 2H,?(x), 
H, being called the Hankel function of the first kind of order n, and 

(b) One running from —i to —i—©, to be designated 3H,,?(x), H, 
being the Hankel function of the second kind of order n. Thus 


Jn(x) = 31H, (x) + A, (x)] (4.361) 
where 
HH, (x) =—- EY AT [ es (52 +1)"-GF? ds (4.362) 
2) % ino 
and 
H (2)(x) —_— eee | i est (52+ 1 )#-2/2) ds (4 363) 
. 2”-1Val(n+4) Ji ) 


We note that H,(s) is the complex conjugate of H,(x), and conse- 
quently that J,,(x) is the real part of both. It is also evident that H,,°? and 
H, are independently solutions of the Bessel equation, since the con- 
dition (4.358) is satisfied at the limits in each case. Therefore, the imagin- 
ary parts of H, and H,,® are also solutions. The imaginary part of 
H, is known as the Neumann function N,,(x), and is a second real solu- 


tion independent of J,(x). The relations between the various solutions 
are 


H,® =J,+iNn (4.364) 
H,® = J, —iN, (4.365) 
J, = 3[H, 9 + H,,?] (4.366) 
Na = 1/(2i) (4, — A, ] (4.367) 


An alternative form of H,, may be obtained by making use of the 
substitution 


s=I—o (4.368) 
This leads to the form 


ix"e' 
2’-1- Val (n+4) 
Expanding in a binomial series the third term in the integrand, we obtain, 
for (1—+) not an integer, 
*-N pik(—s\n—(1/2) © 1 \i {* 
H, (x) = _ xe (—2i)-Ul2) (ag (4) | eX gntl(1/2) 
. Val (n+) 1 )\2) Jo © ° 


_ (—iyet a2) 2 ; €o a -\7 
— Tad) a a> (" 7p) ratl+y (4) (4.370) 
/=0 


H,'? (x) — | e~ FF g"—M2) (gy — Dj) n—M2) da (4.369) 
0 


220 Fourier and Laplace Transforms 


This is an asymptotic expansion; for very large x one has the asymptotic 


forms 
H,(x) =~ = exp i( ~7_4) (4.371) 
In(x) © \00s( -F-4) (4.372) 


N,(x) ~ VZsin( -F-5) (4.373) 


When zn is an integer plus 3, the sum in (4.370) terminates and we ob- 
tain closed forms for the functions as periodic functions times polynomials 
in inverse powers of x. In this case the functions of greatest interest are 
the so-called “‘spherical Bessel functions,”’ which occur in the theory of 
spherical waves in the next chapter; they are designated 


ji(x) = Vz Jisci2y(x) (4.374) 
n(x) = VEN 14+-¢1/2) (X) (4.375) 
hf? (x) = Vz H G2 (x) (4.376) 


Again, it is convenient to deal with the Hankel function of the first kind. 
It is derivable from (4.370), and may be simplified by noting that the 
binomial coefficient in the sum becomes (7), so that the sum terminates at 
| =n, and the [ functions become simply factorials. Thus we find 


W(y) = mie (nt)! (t i ) 
h,? (x) = (—i) > heed (nD! ax (4.377) 
h(x) is found by taking the complex conjugate equation, and j,(x) and 
n(x) by picking out real and imaginary parts. For convenience the 
first three of these spherical Bessel functions are tabulated herewith: 


sin x _ __cOsx 
Jo (x) = x =e 8 No (x) x 
21 cos x _ _cosx_ sinx 
ji(x) =F a n(x) =-=3--— = (4.378) 


: 3 sin x cos x 3 \cosx sin x 
Jo(x) = (3 1) S22 —3 x2? n(x) = (1 ~3,) 82-3 nF 


31. Further Formulas Involving Bessel Functions 221 


31. Further Formulas Involving 
Bessel Functions 


Let us, for the moment, designate by Z,, an arbitrary solution of the 
Bessel equation. It is simple to verify that the equation can be written in 
the ‘‘factored”’ form 


d ,n+l1\(d__n meu: 
(f+? \4 *) yy Ae (4.379) 
Now let us define 
5 _(d_n 
Lr= ( oP “) Ly (4.380) 
If we now operate on Eq. (4.379) with the operator (d/dx—n/x), it 
becomes 
d_n\(d ,nt+l\5 __ 5 
(£-2\(S+ : )Z, a7. (4.381) 
But, on expansion, this equation takes the form 
@sitids _ (n+l?) s _ 
TZ t tS Z,4|I 2 |Z.=0 (4.382) 
which is the Bessel equation of order (n+ 1). We may therefore put 
don Ce es, 
(e—) yy ee (4.383) 


The minus sign is conventional; the reason for its choice will be evident as 
we proceed. In a similar way it is easily shown that 


(f+ “) | a a (4.384) 


Problem 4-29: Verify directly that if Z,, is J,(x), as 
given by the integral representation (4.340), the relations (4.383) and 
(4.384) are satisfied. 


Problem 4-30: Show that the relations (4.383) and 
(4.384) may be written 


d 
dx (x"Z,) =—xX"Lass 


d Hn — 2] 
Gy 2" Zn) = "Lat 


222 Fourier and Laplace Transforms 


The relations (4.383) and (4.384) may be combined to yield the 
following useful formulas: 


22 = Ln-1 — Ln+t (4.385) 
and 
2n 
Ln-1 + Ln+i = x Ln (4.386) 


Problem 4-31: Using the appropriate recursion 
relations, show that 


| | (a? — be) x— P=", (ax)Zq (bx) dx 


x 
= x[bZ,(ax)Z ~1 (bx) — AZ y-,(ax)Z, (bx) | _ (m—n)Zp,(ax)Z, (bx) 


A commonly used representation of the Bessel functions of integral 
order can be obtained starting from the relation (4.385). We multiply both 
sides of the equation, which is of course valid for both positive and nega- 
tive integral values of n, by €” and sum over all n’s. If we write 


F(é,x) = >» Jy (x)&" (4.387) 


we see that F satisfies the equation 


OF _(,_1 
2 ax (¢ F (4.388) 
This may be integrated to give 


F(&,x) = Fo(€) exp3(£—3) x 


The value of Fo(é) may be obtained by noting that F(€,0) = 1; thus 
Fy(é) = l and 


F(é,x) = y Jn (x) & = exp5(£—3) x (4.389) 


The function on the right is the “generating function” for the Bessel 
functions J, (x). 


By the elementary theory of contour integration it is obvious that we 
may write 


J,(x) = iP ai exp E (e-3)| dé (4.390) 


To obtain still another useful form for the Bessel function, let us put, 
in (4.389), 


& = je? (4.391) 


31. Further Formulas Involving Bessel Functions 223 


Equation (4.389) then takes the form 


Crk Sy PT (aye (4.392) 
The right-hand side is the Fourier series expansion of the periodic func- 
tion e*°°s9, From the equation for Fourier coefficients (4.11), 
1 {7 ., te 
J, (x) => bs eiT cos 8 p~in® ig (4.393) 


Because cos @ is an even function, this may be written 


rs 


J, (x) = = et 089 Cos nO dO (4.394) 
0 


PRELUDE TO CHAPTER 5 


In this chapter we develop first the theory of sound 
waves. The wave equation is derived, and shown to be 
satisfied by all the variables of a sound wave — velocity 
potential, velocity components, condensation, pres- 
sure. It is shown that all may be expressed in terms of 
a single scalar, that is, the velocity potential. 

Kinetic and potential energy densities, as well as 
the Hamiltonian, are derived in general and for plane 
waves in particular. The Lagrangian is written down, 
and the equations of motion shown to be derivable 
from a Hamilton’s principle. 

In the next section the theory of guided waves in 
enclosures is developed for both rectangular and circu- 
lar cross section. The “filtering out” of high-frequency 
waves by guides of small cross section is demonstrated. 

From the wave equation with a source term, the 
Green’s function and its transforms are derived, by 
carrying out a Fourier transform in spatial coordinates 
and a Laplace transform in time. 

The spherical wave equation is solved by separa- 
tion of variables. The angular functions are found to 
be spherical harmonics, and the radial ones spherical 
Bessel functions. It is shown that the two Hankel 
functions correspond to diverging and converging 
waves, respectively. An expression is obtained for the 
energy flux in the individual ‘‘partial waves.” 

At this point the basic result needed for the devel- 
opment of the theory of scattering of plane waves is 
derived, the expansion of a plane wave in spherical 
waves. 

The radiation from an arbitrary source distribu- 
tion is calculated, using the Green’s function deter- 
mined earlier. The expansion of the velocity poten- 
tial in spherical harmonics (multipoles) is carried out. 
The energy radiated is shown to be a sum of that from 
the various multipoles. It is demonstrated that the 
intensities from successive multipoles decrease by 


225 


226 


Prelude to Chapter 5 


factors approximately equal to the square of the ratio 
of the dimensions of the radiating system to the 
wavelength. 

Expressing a general time-varying source in 
terms of its frequency components by Fourier trans- 
form, a general radiation formula is derived. In par- 
ticular, the radiation from a moving source is evaluated, 
and the “bow wave” phenomenon demonstrated in 
the case in which the source has a velocity greater than 
the velocity of sound. 

All elements of the preceding theory have analogs 
in the theory of electromagnetic waves, which is 
developed later in the chapter. The difference is simply 
that electromagnetic waves are vector waves, that is, 
what Is propagated is a vector quantity. 

We turn next to a more general discussion of 
Green’s functions for waves. The previously develop- 
ed Green’s function was for an instantaneous source 
in an infinite medium. If one turns to the problem of 
sound waves determined by given initial conditions 
and subject to given boundary conditions, it is found 
that both the initial disturbance and the boundaries 
act as sources, and that the same Green’s function 
describes the different kinds of sources. 

The problem of waves in enclosures, discussed 
earlier in the chapter, is now looked at again from a 
different viewpoint, namely, that of the propagation 
of waves due to a particular disturbance. It is shown 
that resonance occurs as the stimulating frequency 
approaches a natural frequency in the enclosure. 
Rectangular, spherical, and cylindrical enclosure are 
considered. 

The next major topic is the scattering of sound 
waves. There is, of course, a similar theory for electro- 
magnetic waves. The theory developed here is, how- 
ever, very close to that of particle waves in quantum 
mechanics, and results will be carried over to that 
problem when it is discussed in Chapter 9. 

When the incident plane wave is expressed as a 
superposition of spherical waves, each is separately 
scattered, and each has a “partial cross section” for 
scattering. General formulas are derived for these 
cross sections. The case in which there is absorption 


Prelude to Chapter 5 227 


of energy from the incident wave is also treated, and 
absorption and total (scattering+ absorption) cross 
sections are calculated. The so-called ‘optical 
theorem,” developed in the context of quantum scat- 
tering theory by Bohr, Peierls, and Placzek is proven, 
as are certain basic inequalities derived by them for 
the cross sections. 

As a particular example, the problem of elastic 
scattering by asmall sphere is solved, andthe “‘Rayleigh 
sixth-power law” is proven. 

We then turn to the propagation of electromagnet- 
ic waves. The theory of such waves is developed from 
Maxwell’s equations. The problem is formulated both 
in terms of potentials and of the Hertz vector. 

We next consider the problem of the intrinsic 
angular momentum of vector waves. A “‘spin operator”’ 
is derived, and its eigenstates are shown to corre- 
spond to right and left circularly polarized and longi- 
tudinally polarized waves. 

By considering the interaction of electromagnetic 
waves with charged matter, we develop the expres- 
sions for their energy and momentum density and flux. 

As a basis for the quantum theory of the electro- 
magnetic field, we develop the Lagrangian and 
Hamiltonian formulation for the electromagnetic 
field. It is shown how to analyze the fields in terms of 
normal modes and normal coordinates. The fact that 
these normal modes satisfy oscillator dynamics pro- 
vides the starting point for quantum electrodynamics. 
The Lagrangian of the interaction with charged par- 
ticles is also derived. 

The Coulomb or transverse gauge is used to dis- 
cuss the propagation of plane waves, and their basic 
properties derived. Plane waves are shown to be suit- 
able ‘‘normal modes”’ in an infinite medium. 

Next, we return to the theory of multipole radia- 
tion, which is somewhat more complex for the vector 
electromagnetic field than for the scalar sound one. 
Dynamic multipole moments are derived for sources 
consisting of both charges and magnetic moments. 
Formulas for both “electric” and “‘magnetic”’ multi- 
pole radiation are derived, as is a simple relation 
between energy and magnetic moment of the radiation. 


228 


Prelude to Chapter 5 


The theory of multipole radiation is of great im- 
portance in nuclear physics, where it is used to analyze 
the radiation emitted in nuclear transitions. 

Another problem which originated in the liquid 
drop model of nuclei, that of the radiation from a 
uniformly charged vibrating liquid drop, is fully 
solved. 

The next topic treated is that of Cerenkov radia- 
tion from particles moving through matter with a ve- 
locity greater than the velocity of light in the matter. 
This problem is very similar to that of supersonic 
sound waves dealt with earlier. 

The final topic of the chapter is that of plasmas in 
a magnetic field, with special emphasis on metallic 
solid state plasmas. The problem is that of finding the 
normal modes of the coupled system of charges and 
electromagnetic field. This is solved by a “self-con- 
sistent’” approximation, based on the following 
approach: The response of the charged particles 
(electrons) to an arbitrary electromagnetic field is 
calculated. This response comprises induced charged 
and current densities. But these charges and cur- 
rents act as sources of electromagnetic fields. If these 
are then taken to be the same as the original fields, self- 
consistency Is assured. 

The dispersion relations of both longitudinal and 
circularly polarized transverse waves are obtained, 
and the normal modes calculated. Particular cases are 
plasmons (modified by the magnetic field) and heli- 
cons. The transmission of helicon waves through 
slabs is calculated. 

There is an appendix on the method of steepest 
descents, with applications to asymptotic expansions 
of the gamma function and the Bessel and Hankel 
functions, and a short one on the transformation of 
units in electromagnetic theory. 


REFERENCES 


Abraham, M. and R. Becker, Electricity and Magnetism. London: Blackie, 1937. 
Coulson, C. A., Waves. Edinburgh: Oliver and Boyd, 1948. 


Jackson, J. D., Electromagnetic Theory. New York: Wiley, 1962. 


Prelude to Chapter 5 229 


Landau, L. D. and E. M. Lifshitz, Electrodynamics of Continuous Media. 
Reading, Mass.: Addison-Wesley, 1960. 


Landau, L. D. and E. M. Lifshitz, Fluid Mechanics. New York: Wiley, 1959, 
Chapter 8. 


Lord Rayleigh, Theory of Sound. New York: Dover, 1945, 2 vols. 
Morse, P. M., Vibration and Sound. New York: McGraw-Hill, 1948. 


Panofsky, W. K. H. and M. Phillips, Classical Electricity and Magnetism. 
Reading, Mass.: Addison-Wesley, 1955. 


Sommerfeld, A., Electrodynamics. New York: Academic Press, 1952. 


PROPAGATION AND 
SCATTERING OF 
WAVES 


“Life is a wave, which in no two consecutive 
moments of its existence is composed of the same 
particles” 

John Tyndall 


1. Introduction 


We consider in this chapter some problems of wave propagation in 
classical physics, in particular, sound waves in a gas and electromagnetic 
waves both in vacuo and in material media. Sound waves are considered 
first since they are, as will be shown, describable in terms of a single 
scalar quantity; we therefore refer to them as “scalar waves.” (The scalar 
property is a consequence of the assumed isotropy of a gas. The situation 
is more complicated in the case of sound waves in a solid, where crystal- 
line anisotropy may make possible the differentiation of waves of different 
polarizations. ) 

Electromagnetic waves, on the other hand, may be described by a 
vector quantity. The additional degrees of freedom correspond to 
different polarizations of the wave. 

Other kinds of wavelike phenomena are possible, in the sense that 
the disturbance which is propagated may have another character. Elec- 
trons in wave mechanics are characterized by a spinor field; gravitational 
waves by a second-order tensor. We refer to the former in a later chapter, 
in which we discuss problems of quantum mechanics. 

Many of the techniques to be developed here will be seen to be 


231 


232 Propagation and Scattering of Waves 


applicable in quantum mechanics, and mathematically similar problems 
will arise. There are also, however, important differences. In nonrela- 
tivistic quantum mechanics, the time dependence of the field is different 
from that of sound or electromagnetic waves and suggests a similarity to 
the problems of diffusion to be considered in the next chapter. Thus, 
there is a relation between the problems only in so far as the spatial 
dependence of the field is concerned. Also, the unique and varied physical 
situations met in quantum phenomena give rise to more complex and 
subtle problems, most of which have no classical analog. 

Nevertheless, experience in the problems of classical wave propaga- 
tion provides a valuable background of technique with which to approach 
these more difficult problems. 


2. Sound Waves. Derivation 
of the Equations 


Sound waves are small disturbances in the pressure and density of 
a gas. The fact that they are small permits the linearization of the equations. 
Three equations are relevant. The first is the ‘““equation of motion,” 
which describes how a volume of gas reacts to the pressures exerted on 
it by the rest of the gas; the second is the ‘“‘equation of continuity” which 
connects the density and velocity fields which enter into the equation of 
motion. The third and last is the ‘“‘equation of state” relating pressure to 
density. 
We use a field approach to describe the state of the gas in motion. The 
relevant fields are 
(a) the density field, p(r, rt) 
(b) the velocity field v(r, r£) 
(c) the pressure field p(r, f) 
Consider the gas which is instantaneously within an infinitesimal volume 
AV. We must first calculate mass times acceleration for this gas. The mass 
is, to the first order in AV, pAV. The acceleration is the rate of change of 
velocity, viz., 


. | 
lim Ay {v(r + vAt, t+ At) —v(r, t)} 
the first term in the bracket being the velocity field at time t+ Ar at the 
point to which a gas molecule, originally at r, is displaced in time At. This 
acceleration is. therefore, 


_ av 
f= ry, +(v.V)v (5.1) 


So much for mass and acceleration; what of the force appearing on 


2. Sound Waves. Derivation of the Equations 233 


the other side of the Newtonian equation of motion? Assuming that there 
are no significant external forces, but only the force of pressure exerted 
by the rest of the gas, we must determine the net pressure force on our 
volume. If we assume that the gas may be considered inviscid, the pres- 
sure force is normal and is equal to 


FAV = —f pndS 
AV 


(F is then the force field, or force per unit volume). 
The integral is over the surface surrounding AV and n is the unit 
outward normal vector to the surface. But on applying Gauss’ theorem to 


the vectors pi, pj, pk, respectively, i, j, and k being unit vectors along the 
coordinate axes, we obtain 


— | 9P 4s 
| pn. as | Barr 


etc., the integrals on the right being volume integrals over the volume 
AV. Thus 


J pndsS = J Vpd*r 

AV 

Since AV is small, again to the first order in AV this is equal to 
VpAVv 


We are now in a position to write the equation of motion: 


j (3+ (v- vv) =—Vp (5.2) 


A continuity equation obviously connects p and v, since the rate of 
change of density at a point is dependent on its rate of flow. For an arbi- 
trary fixed volume, the rate of change of the mass of gas inside the volume 


is 
2 | pdV = | 2p d°r 
The efflux over the boundary is 
J pv:ndS 
which may be converted by Gauss’ theorem into the volume integral 
J V - (pv) ar 


Since the rate of efflux is the negative of the rate of change of the mass 
inside, for arbitrary volume, 


dp 
apn (pv) =0 (5.3) 


234 Propagation and Scattering of Waves 


To define our problem completely, we must relate the pressure to the 
density through an equation of state 


p=f(p) (5.4) 


We shall see that, because the range of variation of p in a sound wave is 
very small, the detailed form of the relation is not important. If conditions 
are adiabatic, the equation of state has the form 


p=Cp” (5.5) 


Equations (5.2)—(5.4) are quite complicated and in their general form 
would be very difficult to solve. Fortunately, sound waves are only rather 
minute disturbances of normal atmospheric conditions, and this fact 
makes possible the simplification of the equations. Let us look first at a 
Static, silent atmosphere. Let the density be p) and the pressure pp. The 
velocity field will be zero. Then we may write 


p= po(1+s) (5.6) 


where the quantity s, called the “‘condensation,”’ is the fractional variation 
from equilibrium of the density. Then, from (5.4) and using the fact that 
sis small 


BP = P—Po= PoSf’ (Po) (5.7) 

We note finally that (5.2) requires that v — 0 as s — 0, so that v is 

also small. Keeping only terms of the first order in v and s, we may con- 

siderably simplify (5.2). The first term may be approximated by po(dv/92), 
while the second is negligible. Therefore, Eq. (5.2) becomes 


OV , 
Po ryan — Pof' (po) Vs 


or 
OV , 
nS (po) Vs (5.8) 
In the same approximation (5.3) becomes 
Os 
at Y V=0 (5.9) 


the po cancelling out. 
If we now take the divergence of (5.8) and the time derivative of (5.9) 
we may eliminate v to obtain the equation for s: 


EY ; 
at (po) V?s 


We designate f'(p)) by c?; we see that c is then the velocity of sound 
propagation. The resulting equation for s, viz., 


2. Sound Waves. Derivation of the Equations 235 


1 ds 
26 = — < 
V’s <3 972 (5.10) 
is the equation of wave propagation. 
Since, from (5.7) 
p= poc*s (5.11) 


the pressure satisfies the same equation. 
It is possible to show, finally, that if the flow is irrotational, that is 

VX v= 0, v also satisfies the wave equation. For if we differentiate (5.8) 
with respect to time, and substitute for ds/dt from (5.9), we get 

ov 

a2 CV (V-v) 
Now, from a well-known vector identity, V X (V X v) = V(V- v)—V?v. 
Thus, if V x v = 0, the right-hand side of the equation for v becomes 


cV2y 


Of course, it is also known that if v is irrotational, we can write it as the 


negative gradient of a potential ¢, which is known as the “‘velocity 
potential”’ 


v=—-V¢ (5.12) 


96 _ 02.) = 
V ( are s} = 0 
or (d¢/dt) — c?s = a constant in space. We may choose ¢@=0 in a quies- 
cent atmosphere; since in this case s = 0 as well, we may write 
_ | o¢ 
s= are (5.13) 
o, too, satisfies the wave equation, for if we differentiate (5.13) with 


respect to t and substitute from (5.9), at the same time expressing v in 
terms of @ by means of (5.12), we find that 


Equation (5.8) tells us that 


— 2? — W724 (5.14) 


Since we are dealing with a scalar field, we can choose any one of the 
above variables to describe the sound wave, and write the other quantities 
in terms of it. Thus, we may solve in terms of @. Then, s is given by (5.13), 
pby (5.11), and the velocity components by (5.12). 
As in the case of the vibrating string problem, the equations of sound 
propagation are only approximately linear. The approximations made are 
(a) that the fractional density variations in a sound wave are small. 


236 Propagation and Scattering of Waves 


(b) that the second term in the equation of motion (5.2) 1s negligible 
compared with the first. To appreciate more fully what this latter approxi- 
mation signifies, consider the identity 


vX (V Xv) = V(gv2)—(v- V)v 


If the flow is irrotational, V X v = 0, so that the second term in the bracket 
on the left-hand side of (5.2) is V(4v2). But the first term is —V(0¢/0?). 
Therefore, the approximation is that 


dp 


1.2 —< 2P_ 2 
sp“ < = C°S 
- ar 

or 


VU 


(2) <5 (5.15) 


Consideration of some numerical magnitudes will now provide a test of 
the validity of our approximations. At the threshold of hearing for fre- 
quencies of 1000-5000 cycles/sec, s is of the order of 10~?°, while for 
sounds loud enough to cause pain it is 10~* to 10-*. Thus, over the whole 
range of normal ‘‘sound”’ the approximation of small s is indeed very good. 


Problem 5-1: A wind of velocity v blows across a 
Quonset hut in the shape of half a cylinder of radius a and length /. If the 
pressure inside is equal to that out, show that there is a vertical force on 
the hut of 


a pv7al 
As for v, let us note that a plane wave solution for the three-dimensional 
wave equation Is 
@=f(n-r-—ct) (5.16) 


where f is an arbitrary function, and n is the normal to the wave front. 
For if we define u = n-r—ct 


2 
V°o = (nity ene es = af 


and 


1 af 
ear du 


It follows that, for a plane wave 


21k 
l 
: 
ie 


es 
So (5.17) 


3. Dynamics of Sound Waves 237 


where f’ is the derivative of f with respect to the argument uw. But 
v:-n=—n:Vf=—f' (5.18) 


Thus, (v- n/c)? = (1/c?) f’? = s?. Consequently, the approximation in- 
volved in (5.15) is equivalent to the approximation that s is small. 


3. Dynamics of Sound Waves 


Let us first calculate the energy density (energy per unit volume) in 
a sound disturbance. The kinetic energy density is of course pv? = 
3po(V@)? in an approximation. The potential energy may be calculated as 
follows: first we observe that the potential energy is equal to the work 
done by the excess pressure in locally compressing the gas. If we take an 
initial volume AV, this work is 


— f/, B(aV’)a(Av) 


Now p=A/AV, so d(AV') = d(A/p) = (—A/p*)pods. Keeping only 
leading terms, the integral is 


A AV 
=o pds 
Po J avo 


Using the fact that p = poc?s, and that s = 0 when the volume is AV,, we 
obtain for the potential energy 


AV, J : Poc?s ds 
0 


so that the potential energy per unit volume is 


VY = Fpoc?s? (5.19) 
Using this along with the fact that the kinetic energy per unit volume is 
TF =tpo(Vo)? (5.20) 
the energy density is then 
= 2pol (Vp)? + c?s?] (5.21) 
and the total energy is 
H = 3p [ ((Vo)?+c?s?] a*r (5.22) 
Using the expression (5.13) for s in terms of ¢ 
H = 4p | [woy+3(%) | are (5.23) 


An unusual feature of the use of the velocity potential ¢ as dynamical 


238 Propagation and Scattering of Waves 


variable is that the potential energy ¢ involves time derivatives and the 
kinetic energy, spatial derivatives (in contrast, for instance, with the situa- 
tion in the vibrating string problem). 

We can use the expression (5.21) for the energy density to obtain 
one for the energy flux vector. Defining this latter quantity P such that 
P-ndS is the energy flow per unit time over a surface dS with unit 
normal vector n, we can write down a continuity equation for the energy 
balance in a volume: rate of change of enclosed energy = rate of influx 
over the surface. We express this by the equation 


Now 


= p,V- (<2 “ (5.24) 
We may therefore identify P as 


P=—py 24 


= pv (5.25) 
It is evident that the energy flux over an element of surface is, then, the 
rate at which the excess pressure is doing work across this surface. 


The momentum and angular momentum densities may also be written 
down on the basis of simple physical considerations: 


Momentum density = poSv (5.26) 


Angular momentum density = posr X Vv (5.27) 


It is interesting to see the relationships between these quantities for the 
plane disturbance (5.16). We find that 


= 3pof’? (5.28) 
= tpof” (5.29) 
H = pof” (5.30) 
P= poncf"? (5.31) 


(Angular momentum is not interesting for plane waves.) Note first that 
kinetic and potential energy densities are equal. The virial theorem of 


4. Lagrangian and Hamiltonian Formulation 239 


classical mechanics! says that this must be true of time averages in the 
case of quadratic forces, that is, the case of harmonic vibration. Here it 
is further true that these energies are equal at all times. 

The second fact to observe is that the energy flux is c times the 
energy density. Since the loci of points of constant phase are the planes 


n-r—ct= constant (5.32) 


and the normal to these planes Is n, we see that energy is propagated per- 
pendicular to the wave front with velocity c. It is this which justifies the 
statement that c is the velocity of sound. From (5.32) it appears that 
planes of constant phase also move in the direction of their normal with 
velocity c. Thus, the “phase velocity” and the ‘group velocity” for 
sound waves are equal. 

The theory of Fourier transforms in three dimensions enables us to 
find quite general solutions. If we take the Fourier transform ®(k, t) 
of the Eq. (5.14) we get 


OP _ pe 
A simple solution of this equation 1s 
®(k, t) = ®(k, 0)e* (5.33) 


Thus 
(r,t) = f (k, Ne* "dk 


= f &(k, 0)eimr-tend ey (5.34) 


This represents a general superposition of periodic plane waves. [The 
choice of the time dependence e““ in (5.33) would simply produce a wave 
with wave number vector —k; since all k’s are included in (5.34) this 
would give nothing new.] 


4. Lagrangian and 
Hamiltonian Formulation 


The Lagrangian of the system is, by definition, 
L=[(7-V%) de (5.35) 


=1p, | | wo? —4(22)"] dr (5.36) 


In view of the observation, made above, that in this problem velocities 
are represented by spatial derivatives and potential energy by time deriva- 
tives, there would at first sight seem to be some doubt as to how we should 


1See H. Goldstein, Classical Mechanics. Reading, Mass.: Addison-Wesley, 1957, 
Chapter 3, p. 69. 


240 Propagation and Scattering of Waves 


define generalized momenta for the problem. Should we differentiate L 
with respect to the velocities (—Vq@) or the time derivatives (dd/dt)? 
The answer is in fact simply based on considerations of convenience. If 
we use (5.36) as the basis for an action principle, 


5 [ Lar=0 (5.37) 


it is easily verified that we get as the equations of motion, the wave equa- 
tion (we use the plural term ‘‘equations of motion” rather than the 
singular because, in the sense of dynamics, there is an equation for each 


x). For 
5 | Lar=p, {| | ve -ave—2 6 se d°r dt 
= py {| | Va - ¥(36) 55 66 d°r dt 


Using, in the first term, 
V - (86V¢) = Vo - V(8¢) + 846V7h (5.38) 


and noting that, on integration, the right-hand side transforms into a 
vanishing surface integral at infinity, that term becomes 


— Po fl 50V7¢ dr dt 
The other term can be integrated with respect to t, with the usual under- 
standing that we must take 5¢ = 0 at the limits of integration. It then be- 


comes 
— fs | i so <9 o oe P dr dt 


The sum of these terms will then be zero for arbitrary 5¢ if and only if 
the wave equation 


vigut2 PG (5.39) 


is satisfied. 
Now, if we generalize this a bit, and take for the Lagrangian 


L= 12 (¢. Ve, oo) dr (5.40) 


the action principle gives 


5 | Lar=0- [|< 8b + 5: Vob+ scape Ob | dir a 


“J [55 —¥ sway aratearan| 4 


4. Lagrangian and Hamiltonian Formulation 241 


which leads to the equation of motion 


8 If Ly: Of df 
dt 0(dd/dt) d(Vd) dd 
The negative of the quantity on the left-hand side of this equation is 
known as the variational derivative of L with respect to ¢, and may be 
written 6L/5¢. 
By analogy with the string equation (1.6) we may define the ‘“‘gener- 
alized’’ momentum as 


=0 (5.41) 


a(r,t)= (5.42) 


OL 
d(d¢/9t) 
If we do, we may follow the usual prescription for writing the Hamil- 
tonian in terms of the Lagrangian, taking account of the fact that the usual 
sum 2 p;q; in 

H=> pia-L (5.43) 
must, because the coordinates are labeled by the continuous variables r, 
be replaced by the integral f 77(0¢/dt)d*r: Thus the Hamiltonian is 


H= | (7% ao 2) dr 
=| #dr (5.44) 


where # is known as the Hamiltonian density. In the case of our wave 
equation we have 


= — PoS (5.45) 
by (5.13), so that 


(8) [incor t8 ( 


l Po dp\? 
a — 5.4 
= —5 pov)? 5 "2 (=) (5.46) 
This gives a Hamiltonian which is the negative of the total energy 
H={ #d5=—| (7+V) ar (5.47) 


The usual sign in this equation would have been reversed had we, at the 
beginning, changed the sign of ; that is, if we had treated the quadratic 
term in d¢/dt as a kinetic energy term. Provided we are consistent, how- 
ever, the change in sign has no consequences for the physics of the 
problem. 


242 Propagation and Scattering of Waves 


5. Guided Waves 


Consider a sound wave of a definite frequency propagated down a 
long tube of rectangular cross section. The sides of the tube may be taken 
to be the directions of the x and y axes and have dimensions a cm X b cm. 
Let propagation be in the z direction. If we look for a solution of the 
wave equation with a circular frequency , it will be of the form 


b= X(x)Y(y)Z(z)e™ (5.48) 
Substituting this in the wave equation, dividing through by XYZ, and 
cancelling out e~*‘, we get the equation 
1d?X  1d?Y 1d?Z_ @? 
X dx? Y dy VJ ae C (249) 


Each of the three terms on the left must be a constant. The first two con- 
stants are determined by the boundary conditions, which are that d¢/ax = 0 
at x = 0 and x = a, and 0¢/dy = 0 at y = 0 and y = b. Thus we must have 


l 1 d?2X [2472 
X = co cL Yao @ (5.50) 
m 1 d?Y m7 
Y= cos“ F >, Ya? Be (5.51) 
Consequently, 
1d?Z  [?a? mr =a? 
Zd@ @ Bc (5.52) 


Problem 5-2: The end of a long cylindrical pipe of 
radius a is made to vibrate with velocity 


V=fi(r) sinet 
Show that the velocity potential is given by 
ge oat Malta) 


ia? ~ Vk?) 2fa2 Jy? (An) 
« festa Ear 


0 
the quantities A, being the roots of the zero-order Bessel function Jo(x), 
and “Im” means “imaginary part of.” 


@d=Im 


Now unless the constant on the right-hand side is negative, the disturb- 
ance will not be propagated but will die out exponentially with distance 


5. Guided Waves 243 


down the tube from the source. The case / = m= 0 represents propaga- 
tion of a plane wave, and is always possible. But if w is less than both 
cir/a and czr/b, all other components will be attenuated. Thus, for either 
small enough w or small enough tube dimensions, or both, the tube will 
“filter out’’ all but a plane wave. 

The same phenomenon will hold for cylindrical guides. In this case, 
the equation is 


ee a eee (5.53) 


ror\ or] r?0B? a2 Cc at? 
where r, 8, and z are cylindrical coordinates. We can again use separation 
of variables, putting . 
d= R(rje™Z (ze (5.54) 
This leads to the equation 


ld/dR\ 1@Z ww 
raed Zde re a 


The parts depending on r and z separately are constants, so that 


ld/dR\ w# 
Rad) Ea ba) 
and 
1 d?Z 5 
Zd27*-@ (5.57) 


Equation (5.56) is a Bessel equation for which the solution which is regu- 
lar at the origin is 


R=J,(ar) (5.58) 


If a is real, this can be made to satisfy the condition dR/dr=0 atr=a,a 
being the radius of the tube. This is so because J,,(ar) is oscillating, so that 
it has a zero derivative for an infinite number of values. But if a is imagin- 
ary, R will be monotonically increasing with r and its derivative will 
never be zero, so that the boundary conditions can never be satisfied. 

The boundary conditions are in fact satisfied for 


a= Ens/a (5.59) 
E,, being the sth point at which J, (aa) is zero. Equation (5.57) then 
becomes 
a2Z  (w fis) 
——28)Z=0 5.60 
dz? abe a ( ) 


Again, therefore, unless €,, = 0 (plane wave), propagation will only take 
place for sufficiently high @ or sufficiently large a. 


244 Propagation and Scattering of Waves 


6. Wave Equation with Sources 


Let us describe an external disturbance by a “force density” F(r, ¢): 
This signifies the force per unit volume acting on the gas at point r. This 
may be incorporated into the equations by adding this quantity to the 
right-hand side of the Newtonian equation (5.2), so that that equation 
becomes 


0 
o[Se+(-Vv|=—vp+F (5.61) 
Equation (5.8) is then replaced by 
av ewe yt 
van oVst F (5.62) 


Taking the divergence of this equation and using (5.9) we see that 
das 
Cc? at? Do c? 
We may alternatively obtain the equation for the velocity potential ¢ pro- 
vided that F is conservative and may be written 


F=-VU% (5.64) 


—V*s+ —, V°:F (5.63) 


For, writing v in terms of ¢, (5.62) then becomes 


v(Sb—cs+ 7 4) = 0 (5.65) 


From this we may deduce 
dp ly, _ 
va cs a U=0 (5.66) 


since in any case &@ is undetermined to within a constant. If we now dif- 
ferentiate with respect to ¢ and substitute for ds/dt from (5.9) we are led to 
the following equation: 
1?o¢__ 1 0% 
2 —_— 

WOT a 58 oe at (0-67) 
Either (5.63) or (5.67) will serve as starting points for the theory of sound 
waves generated by forces, though (5.63) is more general since it involves 
no assumptions about F. 

Consider now the equation 


_~V2f+ 4 i of = h(r,t) (5.68) 


6. Wave Equation with Sources 245 


of which (5.63) and (5.67) are special cases. Let us take Laplace trans- 
forms, where the transform of f(r,t) is F(r,o) and that of h(r,t) is 
H(r,o). The transformed equation is, then, 


2 
—VF+SF= H (5.69) 


where f and d¢/dt are taken to be zero at t=0. We now take Fourier 
transforms, F(k,o) and H(k,o) being, respectively, the transforms of F 
and H: oe 

(P+o07/c)F=H 
so that 

1 

k? + o?/c? 
We may next use the convolution theorem (4.21) and the fact that the 
inverse transform of 1/(k? + o0?/c?) is (1/4arr) e~?/"" to obtain 


1 


, 


r—r 


F= H (5.70) 


F(r,o) =z / H(r',o) ) e—(alc)\r-r'|  q3y’ (5.71) 


We may then take the inverse Laplace transform, making use of the 
“shifting theorem”’ (4.154) to get 


f(r.) = | a(r', oh) der’ (5.72) 


4a Ir—r’| 


Having obtained this result, it is not difficult to see its physical inter- 
pretation. Let us look at it in the following way. If the velocity of propaga- 
tion c were infinitely great, Eq. (5.68) would merely be a potential 
equation with a source (1/47r)h. The solution could then be obtained, as in 
Chapter 3 (3.128), by adding the contributions from the different parts of 
the source distribution, that is, we would have 

age | men 


Ir—r'| 


d*r' (5.73) 


The feature which the finite time of propagation introduces is, however, 
this: That the field at time ¢t depends not on the source at time ¢, but rather 
on that at a time earlier than ¢ by the time required to propagate the effect 
from r to r’, that is, (1/c)|r—r’|. Consequently, the time ¢ in (5.73) must 
be replaced by t— (1/c) |r —r’|, to give (5.72). 

It should be noted, also, that we may deduce from (5.72) the ‘“‘Green’s 
function” for propagation in an infinite region. The Green’s function has 
the property that the solution can be written 


F (r,t) =| Gi(r,r',t,t')h(r’, t') dr’ dt' (5.74) 


246 Propagation and Scattering of Waves 


The time integral is over all values ¢’ < t. Thus the Green’s function is 


t 

G(r,r'; t,t’) a 3(—r FF) (5.75) 
4n|r—r'| c 

since clearly substitution of this in (5.74) gives (5.72). 

The Green’s function for this problem is seen to depend only on 
relative space and time coordinates; that is, onr—r’ and t—t'. This is a 
consequence of the invariance of the physical system under the transla- 
tions in space and time, so that neither the origin of space or of time co- 
ordinates can enter into the solution. 

This translation invariance, along with the convolution theorems for 
Fourier and Laplace transforms, enable us to relate the transforms of the 
solution to those of the source. Using the subscripts f, / to relate to 
Fourier and Laplace transforms, respectively, it follows that 


F,(k, t) = f G,(k, t—t')h; (k, t’) dt’ (5.76) 
F(r,o) = { Gr—r',o)hy(r’, 0) dr’ (5.77) 

and 
Fy(k, o)= Grl(k, o )hya(k, a) (5.78) 


the quantities in the last equation being transforms in both variables. It 
will be useful to record here for later reference the transforms of the 
Green’s function: 


G, (k, t) =7sin ket 6(t) (5.79) 
Gi(r,0) = zo ewe (5.80) 
[cf. Eq. (S.71)] and 
1 
Gr(k,o) = Prod (5.81) 


[cf. Eq. (5.70)]. 0(t) in (5.79) is 1 fort > O and O fort < 0. 

It should be mentioned, finally, that the time variation can be 
analyzed by using a Fourier rather than a Laplace transform (see Chapter 
4, Sec. 4). If we take Fourier transforms of the original equation (5.68) 
in both space and time variables, and let the Fourier transform in the 
time variable of f(r, t) be 


ee er ie ‘eat 
fir, @) = 5— [- f(r, the dt (5.82) 
and the double transform 


F, (k, w) _ ony - ar | a°rf (r, te —i(k-r—ot) (5.83) 


6. Wave Equation with Sources 247 


the transform of (5.63) is 


2 
(x -£) F,=H, (5.84) 
f may be written in terms of its transform F,, (k, w) by the inverse theorem: 
f(r, t) = f Fi(k,o)ei@ #0 dk de (5.85) 


It is therefore expressed as a superposition of plane waves [cf. (5.34)]. 
Since 

aera: eee 
~~ k2 — @&? ea 


the evaluation of the inverse transform is complicated by the existence of 
two poles on the real axis. Because by definition, fis the convolution (in 
both space and time coordinates), of h(r,t) and the Green’s function 
G(r—r’,t—t’), F,(k,@) must be (27r)* times the product of the trans- 
forms of h and G. Therefore, the transform of G is 


F, H, (5.86) 


4 
Gimove (5) SES (5.87) 
The problem of these singularities is then associated with the inversion 
of the transform of the Green’s function. The resolution of the problem is 
related to the conditions of causality which G must satisfy, and this prob- 
lem has already been discussed in the last chapter. It is instructive, how- 
ever, to look at the problem from a different, and more physical, view- 
point. The wave equation involves only second derivatives with respect 
to t, and its solutions are, therefore, time-reversible. It cannot, therefore, 
by itself, pick out a direction of time such as is required for considerations 
of causality. Nature, however, contains irreversible features because of 
the existence of damping mechanisms. These could be incorporated in 
the wave equation by adding a term B(df/dt) on the left-hand side of 
(5.68). Such a term would appear, for instance, if one took account of the 
viscosity of the gas. Problems involving a linear time variation of this 
nature are fully studied in the next chapter. 
In any case, the effect of such a term is to replace the equation (5.86) 
by 
-__! oy» 
k? —@?/c? —iwB* 


However small 8 may be, the poles in w have now been displaced to 
points below the real axis by an amount Bc?/2. Consequently, if the in- 
verse 1s Calculated with the poles in these positions, the requirements of 
causality will be met. After the evaluation, however, we may let B be- 
come as small as we choose. 

It is possible now to invert (5.88) by using convolution theorems in 


F; (5.88) 


248 Propagation and Scattering of Waves 


both the space and the time variables. The inverse transform of 


a5 il 1 
Gi(k, @) = (27r)* k2? — w/c? — iwB 
is,asB > 0 
G(r,t) = im | tite ran dkdw (5.89) 
(2rr)4°"° J k?— w/c? — iwB 


When t¢ < 0, the w integral can be evaluated by taking a contour which 
consists of the real axis and an infinite semicircle about the upper half- 
plane, on which the integrand goes exponentially to zero. Since there are 
no poles in this half-plane, G = 0 for t < 0, which is of course, the state- 
ment of causality. When t > 0, the contour has to be taken around the 
lower half-plane, and there are two poles. Evaluating the residues, one 
obtains 

1 e ikr (e~ ket — eikct) dak 


Gir.) =TE53 |; 


= 302 1 ik-r cy 3 
=| fem sin kct d*k (5.90) 
Taking r as polar axis for the k integration and evaluating the angular 
integrals gives 
ee ne , 
G (r,t) Cee je sin kr sin kct dk 


ee ee 
acer [- [cos k(r—ct) —cos k(r+ ct) ] dk 


=4> [8(r—ct) —8(r+ct)] (5.91) 


The second 6 function is, however, zero, since both t = 0 andr = 0, soits 
argument cannot be zero. Also 


cé(r—ct) = c8(ct—r) 


=§ (:—<) (5.92) 


Cc 
Thus 


G(r, t) = ge8 (1-2) (5.93) 


so that we have again obtained the result (5.75), but this time by using 
only Fourier transforms. 

It should be noted that the Green’s function (5.93) is a spherical 
wave; that is to say, a solution of the wave equation in spherical co- 


7. Spherical Waves 249 


ordinates. It corresponds to the elementary solution 1/r in the case of the 
potential problem. Just as in that case we can, by separation of variables, 
get the most general such solution. Let us now solve that problem. 


7. Spherical Waves 


The wave equation in spherical polar coordinates is 


1oa/.,of\, 1] 1 0 1 of)_1ef 

r? or (- af) +2 Bs sin 6 00 (sin : ) v sin? Odw?| = c? or? O28) 
Particular solutions may be found for which f= e~“'S (6 - p) R(r). For if 
we substitute this in (5.94) and divide by f, we get 


1 d/,dR\, 1 _ & 
AR ar (Gr) tag M8) = —S 


where A 1s the differential operator which was studied extensively in the 
chapter on potential theory: 


1 0 1 @2 
~ sin 0 0 4 (sin : S)+ sin’ 0 d¢ O2>) 


Multiplying (5.94) by r?, we see that it separates into a part which is a 
function only of r, and another which is a function only of the angular 
coordinates. Each must then be a constant. But we saw in the discussion 
of the potential problem that, for the solution to be well behaved as a 
function of angle, 


= A(S) Sei) (5.96) 


where / is an integer. Normalized solutions of this equation were found to 
be 


S=Y," (0,¢) (5.97) 
as defined in Eq. (3.117). 
The radial equation now takes the form 


d?R 2dR of _10+1)) _ 
gate + (S r2 R=0 


(5.98) 


As noted earlier, when the velocity of propagation c — ©, the equation 
becomes that of potential theory. It is homogeneous and has solutions 
r'and 1/r'*!. For finite w, however, it is more complicated. 

For small r, however, the w?/c? term is negligible, so that in this range 
the solution has a behavior of this nature. Let us write 


R(r) = r'u,(r) (5.99) 


250 Propagation and Scattering of Waves 


and determine the equation for u,. By straightforward calculation, we find 
it to be 
ce a*u; 
ae 2 


di 


+ (2141) Ft + Kru, = 0 (5.100) 


where 
k= alc (5.101) 


Since this equation has linear coefficients, we recognize it as one which 
can be solved by our generalized Laplace transform method. However, 
by a simple transformation, we can reduce it to a Bessel equation. If we 
put kr = € and 


= €12y (€) (5.102) 
The equation for v, is found to be 
1 I+ 4)? 
at+zu+(1— ae )u=0 (5.103) 
The solution is, therefore, 
R, = Ve cisen (kr) (5.104) 


where Z is any linear combination of solutions of the Bessel equation of 
order /. If the solution must be regular at the origin, Z;4,,;2,) must be the 
Bessel function Jj4,1;2)- 

We may use the standard notation of “spherical Bessel functions” 
introduced in the last chapter [Eqs. (4.374) to (4.376)]. We shall extend 
the notation introduced there by defining 


2(x) = J as (x) (5.105) 


It should be noticed that h,{” (kr) defines a ‘‘diverging” spherical wave, 
with the property that at distances r such that kr > 1 the flow is away from 
the origin. 2; (kr), on the other hand, represents a ‘‘converging”’ wave, 
with energy flowing toward the origin. To demonstrate these facts, we 
use the asymptotic forms of these functions for large argument, which 
may be derived from (4.377). The asymptotic forms are 


ikr 
h® (kr) = ag (5.106) 
and 
—ikr 
h® (kr) = ett+1X712 = (5.107) 


At these distances, then, the complete solution is 


7. Spherical Waves 251 


eilkr-wt—(i+ 1)77/2)) 


Pout = 7 Yim (9, ¢) (5.108) 


in the one case and 
e ilkr+ot—(l+17/2)) 
din = Ap ———| —— Yim (9, @) (5.109) 
Let us calculate the energy flux in terms of these imaginary potentials. 
The radial energy flux has been shown to be [Eq. (5.25)] 


— do, do, 2 5 1 1 
P po | Se 88:2 an (5.110) 
where @, is the real part of the complex @, and the integral is over solid 
angle at a given radius. P may be written 


| 9b , Ob*\ (9G , 96") | 
P tpo | (22+ alee or dQ (5.111) 
Since (d¢/ot)(dd/dr) depends on the azimuthal angle ¢ as e?*"* and 


(0@*/dt)(dd*/or) depends on it as e~?*”*, both of these terms integrate to 
zero. Consequently, 


—_4 dp dG* 
P spe | 2b OO" 2 a0 (5.112) 
Now for the case of @ = dout 

PF) ; i[kr—wt—((1+1)7/2)) 

oo = — iy > — Yn (5.113) 
and 

dd* e ilkr—ot(1+1)7/2)] 1 
gr tkAr kr Yin + 0 (gz) an 


By virtue of the normalization of the spherical harmonics, the integral 
may be evaluated immediately to give, for kr > 1, 


= $p0% Ail? (5.115) 


= £pc|A,|* 
This is positive, corresponding to a flux in the positive direction, that is, 
in the direction of increasing r. 

If we now do the same calculation for ¢;,, only one thing is changed, 
that is, that k is replaced by —k. Therefore, in the light of (5.115), the flux 
in this case is radially inward. 

A general spherical wave with a given frequency is 


d= & AimZi(kr) Yim (0, ) e@ (5.116) 


252 Propagation and Scattering of Waves 


For an arbitrary frequency spectrum, we have 
=D Ain o)z1 (Sr) e deo Yin (8,9) (5.117) 
l,m 


In the case of axial symmetry (5.116) simplifies to 


b= > Bz (kr)P, (cos 0) e (5.118) 


8. Expansion of a Plane Wave in 
Spherical Waves 


A plane wave solution for arbitrary frequency is e“*'-@) , Taking, for 
simplicity, the polar axis as the direction of the propagation vector k, this 
becomes e%*?—#) = gikrcos@ e-iwt, CQmitting the time-dependent term, e*7°s® 
is an axially symmetric solution of the wave equation. It must therefore be 
possible to put 

eikrt =) Bij (kr)Pi(e) (5.119) 


Here we have put uw = cos @ and z, = j, because e*" is finite at r = 0. 
To determine B,, multiply by P,,( 2) and integrate over x between — 1 
and 1. We obtain 


1 
Buin (kr) x2 = | ef Pu) dy (5.120) 


B, may now be most simply obtained by equating coefficients of (kr). 
Using (4.339) this coefficient in j, (kr) (2/2n + 1) is 


Problem 5-3: A gas contained in a spherical reson- 
ator of radius R has an initial velocity potential d(r,f) =fo(r) at time 
t= 0, and no condensation. Show that the velocity potential at time ¢ is 
given by 

_2 Ssinayr/R cosa,vt/R 

oo 2 r 1 — (sin 2a,/2a, ) 


R t 
x | r'fi(r') sin dr' 
0 


the quantities a, being the roots of tan x = x. v is the velocity of propaga- 
tion of sound in the gas. 
Problem 5-4: A conical horn has a flexible base of 
radius R and semivertical angle a. If the velocity of the base r= R is 
v(R, t) = vp Sin of 


show that the velocity potential is given by 


8. Expansion of a Plane Wave in Spherical Waves 253 
_ sin lot — k(r— R)] ~2ka cos [wt—k(r—R)] 
$(r, 1) = 2v9R vz 1+ 42a? 


where k = w/c and c is the velocity of sound. Why does this not depend 
ona? 


moot 62) Jz etn 1 2 
20 (n+ 22 2n+1 V2 (2n42)!Vu 2 Int I 


__ 2"?(n+1)! 
(2n+2)!(2n+1) 

_ ni2gnrt 

~ (2n+1)!1(2n+1) 


where we have used (4.336) for (n+). 
Consider now the right-hand side. The coefficient of (kr)* is 


i” 


1 
£ | wrP, (1) du 
° —!1 


This may be evaluated by using the recursion relation (3.84) for Legendre 
polynomials. Each factor uw yields polynomials of one higher and one 
lower order. But the integrals of all polynomials except P, = 1 are zero; 
therefore, the only terms which concern us are the polynomials of lower 
order in each instance. The n factors of « make it just possible in this way 
to obtain a P, term. From (3.84) it is evident that the coefficient of P, in 
BP, (mw) is 
n_n-i_ 1 2” (n!)? 


2nt+12n—1 1 (2n+1)! 


B,, 1s, therefore, determined by the equations 


222+1py ! n Drtl 


(Qn+1)t Qn+1)'Qn+l) 
or 
B, = i'(2n+1) (5.121) 
The expansion of a plane wave in spherical waves is then given by 
ekz — S i (214+ 1)j, (kr) P, (cos 6) (5.122) 
i=0 


An interesting corollary is the expression for the spherical Bessel 
function as an integral involving the Legendre polynomial of the same 
order. Multiplying (5.122) on the right by P,,() gives 


2inj, (kr) =f e**P,() dy 


254 Propagation and Scattering of Waves 


so that 
—jy)n fi 
Ja(kr) =" [erm (u) di (5.123) 
-1 


The expansion (5.122) plays an important role in the theory of scatter- 
ing which will be developed subsequently. 


9. Radiation from a Periodic Source 


We have shown, in Eq. (5.72), that the radiation (r, ¢) from a source 
o(r, t) is given by 


$(r, t) =z, | Sel) roe ey) gay! (5.124) 


|r—r’| 
If the time variation of the source is periodic with a frequency a, that is if 
o(r,t) =o (r)e (5.125) 
eq. (5.124) becomes 
—iwlt —»_7/ 
(r,t) = ae | oy (r') —— exp E rol dr’ (5.126) 


jr—r'| 


For regions outside the source, that is, for values of r such that o,(r) = 0, 
this must have the form of (5.116). In order to obtain it in that form, how- 
ever, we must expand 

e i(w/c)|r—r'| 


4alr—r'| (5.127) 


in terms of spherical harmonics in the angular coordinates of the point r. 
This expansion may be conveniently carried out by writing the func- 


tion e“/OF/477-R in terms of its Fourier transform. For, inside the trans- 
form integral, the coordinates enter only through 


ek R= eik-r ewkr 


But, as we have seen in the previous section, the two terms of this 
product may be expanded in terms of the spherical harmonics of the 
coordinates of r and r’, respectively, and we should therefore obtain the 
formula sought. 

First then, what is the Fourier transform of (1/47r)e**/©F/R? Let us 
first calculate the transform of (1/47)e-°*/R, where B is some complex 
number with positive real part. This transform is 


3 —BR 
(Ld [Gtennen 


Taking k to be the polar axis for the R integration, and integrating over the 
angular coordinates, we get 


9. Radiation from a Periodic Source 255 


(4 ) [ SSE ear = I 1 


ar R kR (2a)? k? + B? 

Therefore, 

e ] . 

icr=| e+ pee ak (5.128) 
Now 

ekr = Aor > ij, (kr) Y,7(Q)Y,-™(QO,) (5.129) 
t.m 

and 


enmr'= Agr Y (— i) hn kr’) Yn (O') ¥™ (Ox) (5.130) 


Here we have made use of the formula (3.136) expressing the Legendre 
polynomial of the cosine of the angle between two vectors in terms of the 
polar angles of the vectors. 0, 0’, Q, represent the polar angles of r, r’, 
and k, respectively. 

If these equations are substituted in (5.128), the integral over 0, 
may be done immediately to give, using the orthogonality of the Y/“(Q,,)’s, 

-BR 

— 25 Ss YQ) ™(0’) [. Pep ges he in (kr) k? dk (5. 131) 


Wz 0 m=-l 


If, now, B is purely imaginary, the integral (5.131) becomes improper. 
It may, however, be evaluated for B = —i(w/c) +, where e is small and 
will ultimately be allowed to approach zero. 

Let us now suppose that r > r’: If the reverse is the case, the appro- 
priate formula may be obtained by interchanging r and r’. The poles of the 
integrand in (5.131) are at 


We know that 

ji(kr) = s[hy (kr) + hy (kr)] (5.132) 
But h,‘?(kr) approaches 0 on a semicircle whose radius approaches 
surrounding the upper half-plane, while h,?(kr) behaves likewise in the 
lower half-plane. This will also be true for h{?(kr)j,(kr') and hY(kr)j,(kr’), 
by virtue of the fact that r > r’. Breaking up j;(kr) into its two parts, and 
evaluating the integrals involving them, respectively, by treating these 
integrals as parts of contours which are closed by semicircles of infinite 
radius encircling the appropriate half-plane, we find, as e — 0, that 


“1, kr’); 2 1 [* lo, , 
; a r )irlkr)k ak=7{_ Fe pede lker ) [A (kr) 


h, (kr) ]k2 dk = mie E (2 rag? (2 r) 


+i(-2r)n(-24)] ssa 


256 Propagation and Scattering of Waves 


iG-te)= onl) 
hy (~2 r) = (—1)'h, (2 r) 


The first follows from (4.377) and the second consequently from (5.132). 
Therefore, 


Now 


and 


[Epi iene dk =FEi(Sr ne (Sr) 6.134 


Substituting this into (5.131) we obtain finally the formula we have been 
seeking: 
eioloR 


st! 

4nR 2 > yy Y"(Q) YOO )ji (2 r' Sas? (2 r) (5.135) 
l=0 m=— 

If this is now eee in (5.126), we obtain the result 


o(nn=2> ¥ | cote ya (20) vem(a") a 


l=0 m=-l 


x ng (2 r) ¥,(Q) e-it (5.136) 


This 1s now in the form (5.116), as required. The integral is a sort of 
radiation multipole moment; the /th term in the expansion is designated as 
multipole radiation of the /th order. 

In the case in which the dimensions of the source are small compared 
with the wavelength of the radiation, i.e., in which (w/c)r’ < 1 over the 
whole source, we may replace j,[ (w/c)r’] by the leading term in its series 


expansion: 
(2 r) oe = 1] 1 @ _,\ul2) 
Ne") © N2eleyr' TUF mee) 


~“area! (er) 


os er w ,\! 
=orpile”) 


oo l 
$(r, 1) Ss z Gre (S) | ven @oote') ar 


x hy & ) ¥,"(Q) e-iat (5.137) 


Then 


This may then be written in terms of the electrostatic multipole moments 


9. Radiation from a Periodic Source 257 


defined in (3.142) " here replaces p): 


o(r, La pyr — ani(2 yr omni? (2 r) YO) e-tot (5,138) 
Vey 


Let us sae the intensity of the radiation at some distance from the 
source. The Hankel function can then be replaced by its asymptotic form. 
We also pick out the real (physical) part of ¢ to get 


- 1 21! (w\"* Epi ee: 
ee aaa om (=) Jorn ll¥im(Q)| 


x—< cos (21— Fort mo— am) (5.139) 
where we have written 
O-” = |Q,-™| eam (5.140) 
Now, by (5.25) the radial energy flux is 
P, = po, 
__, 96d 
Pos, dt or (8.141) 


where the subscript r indicates radial components. But at large distances 
d¢/dt decreases like 1/r; so also does 0¢/ar, terms coming from the deriva- 
tive of the rin the denominator of (5.139) being negligible. Thus 


1 21 aaeee 
wD > > Visaizn anile ) lormiiyen(ay| 


x2 sin (2-4 cot + mp— an) (5.142) 
and 
o oF vyy V+ 
97D > yearey aie) lar livrcay 
x4 sin (2 rE ott m'o— om (5.143) 
The total energy flowing from the source is 
—— a = ( | P,r? a) (5.144) 
where {_ ) designates a time average. 


If we first perform the @ integral, we find that the only products 
which do not yield zero are those for which m= m’'; when m= m’, the 


258 Propagation and Scattering of Waves 


value of the @ integral is 7. Next, the integral over 6 may be done. By 
virtue of the fact that 


1 
| YO) |¥#(O)| dO => be (5.145) 


as may be seen from (3.128), the rate of radiation of energy is 


aU per < ] 2! |? f@\22 | 
atti py > 4a (21+ 1) antl (2) lQrm|? (5-146) 

The terms of different / correspond to what is known as radiation of 
different multipole order; /= 0 corresponds to monopole radiation, / = 1 
to dipole, / = 2 to quadrupole, etc. Since the /th term may be expressed in 
terms of a dimensionless quantity (wa/c)*'*?, where a is the radius of the 
radiating source and c/w = 27d, A being the wavelength of the radiation, 
the multipole expansion is in fact an expansion in powers of (a/27))?. If, 
therefore, the source is small compared with the wavelength, the intensity 
of the radiation decreases rapidly with the multipole order. A situation of 
this sort will be seen later to apply to electromagnetic radiation from 
atoms and from nuclei. 

The formula (5.146) is valid even when (w/c)a is not small, though 
the form of Q,-™ 1s different in this case; it is, in fact, 


! l 
Q,-" = V4x (21+ 1) co (<) i colt’ )i(2 r') ¥-™(Q') dr’ (5.147) 


Problem 5-5: The center of a sphere of radius a 
undergoes vibrations given by Z = Z,~-‘, where Z, is small. Show that 
the boundary condition at its surface is given approximately by 


od =] —iwt 
( aa lwZy COS 0 e 


Problem 5-6: Show that the solution of the wave 
equation satisfying all the conditions of the problem is 


icZo 


p= h,™(ka) h,? (kr) cos 6 e~ 


where k = o/c. 


Problem 5-7: Verify that the pressure is 


PoC WZ o 
h,®' (ka) 


and that the net force of reaction on the sphere is —27ra? f p cos 6 dé in 


p= h,(kr) cos 6 e~ 


10. Time-Varying Source 259 


the z direction, that is, 
F,= _ 4 9 Poc@Zo_ 


3 waz 7," (kay h, (ka) e~ i 


Problem 5-8: By calculating the physical (real) part 
of F, show that this force has a component proportional to the velocity 
of the sphere and another proportional to its acceleration: They are, 


respectively, 
4 


: 7A" PoC wZo ee sin wt 
and 
ka(2+ (ka)?}] 
4+ (ka)4 


whereas the corresponding components of velocity and acceleration of 
the center of the sphere are 


: Ta" poCwZ COS wt 


—wZ,y Sin wt and — w’Z, COS wt 


Problem 5-9: Prove that the total radiated energy is 


2 poc*Z 0” _ 2 Poc*Z 9” (ka)® 
37 |h,' (ka)|?~ 3" 4+ (ka)4 


10. Time-Varying Source 


In the preceding section, we have dealt with the case of a simply 
periodic source. If it has an arbitrary time variation, the problem is no 
more difficult to solve. Consider, for example (5.63), where we designate 
the right-hand side as S(r, tf). We may make a Fourier analysis 


So(r,t) =f oo(t,w)e de (5.148) 
where ~ 


oO 


Or, w) = = So(r, the dt (5.149) 


—& 


Then, the solution may be written 


s(r,t) = go | % o(r', t= ([r—r'|/c)) dy’ (5.150) 


Ir—r’| 
Introducing into this the Fourier transform (5.148), we find that 


s(r, th= ml 4 » {we Colt @) = eio(t—(\r—r'|/c) dQ 3p! 


tf eit dey pon (iwledie—rl 3p (5.151) 
4 —0o lr r’| 


260 Propagation and Scattering of Waves 


The inner integral is simply the solution for a simple periodic source, as 
calculated in the last section. The solution to the present problem, is, 
therefore, obtained by superimposing the solutions from such sources. 


Problem 5-10: Prove that the total radiated inten- 
sity is made up of separate contributions from each frequency, and that 
interference terms vanish. (Note that this is only true if the radiated 
intensity is integrated over time.) 


Problem 5-11: Determine the solution to the wave 
equation when the source is zero for negative ¢ and has a time variation 
e-“ for positive f. 


11. Radiation from a Moving Source 


Consider a point source of constant strength moving with a uniform 
velocity u. This may be treated by the method of the previous section: 
In this case 


So(r, t) = S8(r— ur) (5.152) 
and, therefore, 
ee 2 8(r—ut)e-t dt (5.153) 


—o 


It is convenient to take u in the z direction. Then 
aCe = = 8 (x)8(y) e-em (5.154) 


Letting z’ be a vector of magnitude z’ in the z direction, and putting 


s(r, t) = i dwe™t 5(x')d(y') yd (y’ ) e (2"10) @—(iw/c)|r—r"| (5S. 155) 


ini jr—r'| 


we may do the x’, y’, and ¢’ integrals to get 


__S_ ! lie pie a 
s(t) = Ze | mar als z' | =) de (5.156) 


In the first place, then, we note that s will be different from zero only 
if the argument of the 6 function is zero, which requires that 


ut— 2! == \r—z'| (5.157) 


This relation may be represented graphically, as in Fig. (5.1). For an 
arbitrary point z’, points r satisfying (5.157) lie within a sphere of radius 
(c/u) PQ of P(z'). We consider two cases. If u < c, that is, if the source is 


11. Radiation from a Moving Source 261 


~ sae 
ee eee ee oe 


Figure 5.1 


moving with a speed less than that of the velocity of propagation of sound, 
this sphere may have arbitrarily large radius as z’ ~ —©, and sos=0 
everywhere. The radiation at large distances will, however, originate 
from the source when it was at large negative z’. This situation is repre- 
sented by the large (dotted) circle of diameter SPT. 

In the case u > c (source moving with supersonic velocity), (5.157) 
will be satisfied for a given z’ for r inside the sphere represented by the 
radius PR < PQ. If we put 


a PO Sine (5.158) 


c PR 

Q 
we see that s = 0 only inside the cone of semivertical angle a with vertex 
at O = ut. This is the “‘bow wave’’ effect which is a familiar phenomenon 
(in a two-dimensional geometry) of water waves, where the velocity of 
wave propagation is very small. We discuss the analogous situation for 
electromagnetic waves in a dielectric medium (Cerenkov radiation) in the 
next section. 


To evaluate (5.156), it is necessary to find the values of z’ for which 
the argument of the 6 function is zero. Squaring (5.157) we obtain 


2 2 2 
a (1-5) — 22’ (ur—4 2) +48 (2-5) =0 (5.159) 


262 Propagation and Scattering of Waves 


which has the solutions 
u2 ur 
Zi2= E Ga ex(1—) + (ut — z)’ | /(1 -*) (5.160) 
where ¢? = r?— 2”, In the case u > c this gives real solutions only when 
ue 
ut—z>é&,/+—l=€cota (5.161) 
which again defines the interior of the cone shown in the figure. Now 


[ ecenacr(e)) de’ = f e(zyacip(z?y)) ALI! 


|f'(z')| 
Bz) | 
= 5.162) 
ZF (al |f' (z:)| 
where z; are the zeros of f(z’). But 
yl gaz 
Fe = tee 
Therefore, 
440 a u 
f'(z’)| [Jr 2'| + (ule) (2'-2z)| 
u 


ete eee (5.163) 
let — (c/u)z' + (u/c) (z’—z)| 


when z’ satisfies (5.157). If now z’ is taken to be one of the values given by 
(5.160), 
zz ((- 2) = alae? (1-8) 
CS 22 hd So oe Oe Ie 
Cc uc} u Cc Cc 


= e(1-4) + (uta) (5.164) 


We must now check to see whether one of the roots (5.160) might not 
be an extraneous one resulting from the squaring of (5.157). It is clear 
from (5.156) that for true roots t—(z’/u) must be positive. Now from 
(5.160) it is easily seen that 


7-2 = Vw le) (z— ut) = VE = (wele*)) + (ut—z)* 
uu 1 — (u*/c*) 


If wu > c the denominator is negative. On the other hand, in the numerator 
the first term is greater than the square root. Therefore, the minus sign 
does correspond to an extraneous root. Likewise, if u < c, the denomina- 
tor is positive and the square root is greater in magnitude than the first 


11. Radiation from a Moving Source 263 


term, so that again we must reject the contribution from the term with the 
minus sign. 

It follows, then, that only one root of (5.157) gives a contribution to 
(5.156); using (5.162), (5.163), and (5.164), we find that 


1 


(ee eS 
4 VE? (1 — (u2/c?)) + (ut—z)? 


(5.165) 


This is then the form of the solution in general when u < c, and inside the 
cone mentioned earlier when u > c. It should be noted that in the latter 
case s becomes infinite on the surface of the cone, which violates the basic 
condition (namely, that of small s) on which our equations were derived. 
We cannot, therefore, believe the solution in detail in the neighborhood of 
the surface of the cone. No such limitation appears when we treat the 
corresponding electromagnetic problem, due to the presumably exact 
character of the wave functions in that case. 


Problem 5-12: Show that the solution due to a point 
source of strength S, moving according to the general law 


r= &(t) 
IS 
s(r, t) = So IEA! eee 
4a |R(r) — (1/c)v(7) -R(7)| 
where 
R=r-—é 
v = dé/dt 


and r(f) is the time satisfying 


Ir —&(7)| = c(t—7) 


(this is the equivalent in sound wave theory of the Lienard—Wiechert 
potential of electromagnetic theory). 

Use the above result to solve the problem of the source moving with 
uniform velocity. 


Problem 5-13: Use the result of the previous prob- 
lem to solve the problem of a sound wave generated by a point source 
moving periodically according to the law 


Z =Z%COSwat 


in the z direction. 


264 Propagation and Scattering of Waves 


12. Solution with Initial and 
Boundary Conditions 


Let us consider the problem of solving the equation for the propa- 
gation of sound waves for given initial and boundary conditions. In 
particular, we suppose that the condensation s(r,t) is given to have the 
value s,(r, f)) at an initial time t = fo, and that the normal velocity is given 
over given boundary surfaces which we designate as S$. Thus we take 


of = cs, and v=0 at t=h (5.166) 


—n-Vo=—22 =n, on S (5.167) 


Before showing that it is possible to express the solution in terms of 
Green’s functions, we prove the “‘reciprocity theorem’ for those Green’s 
functions; that is, 

Gi(r',r’,t'—t") =G(r",r’,t' —2") (5.168) 


This says that the solution at r”, t’ due to an instantaneous point source at 
r’, t’ is the same as that at r’, ¢’ due to an instantaneous point source at 
r”, t”. 

This is a trivial result in an infinite medium, since, due to translation 
invariance, the Green’s function can then only depend on |r’—r’|. 
However, we want to consider Green’s functions satisfying boundary 
conditions of the form (¢@G/dn) —aG =0 on prescribed surfaces. (Par- 
ticular cases are 0G/dn = 0, corresponding to a= 0, and G = 0, corre- 
sponding to a > ©.) 

The result (5.168) may be proven as follows: The Green’s functions 
G(r,r’,t'—t) and G(r,r”,t—t") satisfy the equations 


2 
5 SG 1) -VG (rr 1) = 8(r—F') BC =) 
and 
] 3? " Li 2 " i" — " ” 
ope Ceo jt—t") -—V?G(r,r",t—t") = d(r—r”)d(t—2’) 
If we multiply each of these equations by the solution of the other, 
subtract, and integrate, we obtain, on integration by parts, 


i dr |S G(r," t'—1) Gre t—2) 


J / 0 i" LA a 
—G(r,r’,t’—t)—G(r,r",t—-t’) 


12. Solution with Initial and Boundary Conditions 265 


a oO Wg ght 
+] [ [ear ths G(r.r't i) 
“G(r I-12 Gerd dS dt 


= G(r',r’,t'—t")—-—G(r',r’,t' —1#’) (5.169) 


We have assumed that t’ > 17". The time integral is taken from a time 
slightly less than ¢” to one slightly greater than t’; the surface integral in 
the second term is over the bounding surfaces on which the boundary 
conditions are prescribed. The integral at infinity is assumed to vanish. 
The normal derivatives are drawn outward from the region in which we 
are solving, that is, into the bounding surfaces. 

If the boundary conditions are of the form 


on 4G =9 (5.170) 


the second term on the left is zero. The first term is zero since the Green’s 
function is zero for negative time argument (causality condition). Thus, 
the right-hand side is zero and the reciprocity theorem is proven. 

We now show how the potential can be expressed in terms of Green’s 
functions and the initial and boundary conditions. 


When there is a source S,(r, t) in the potential equation we may write 
that equation as 


2 
~V"b(r',1') +o O(E' 1) = Sole’) (5.171) 
or 
We combine with this the Green’s function equation 


—V"’G(r,r’,t—t +45, Gir,r’,t—t')=8(r—-r’)d(t—?’) (5.172) 
In the second equation we have used the reciprocity theorem. 

We now multiply the first equation by G, the second by @, and sub- 
tract. We then integrate as follows: with respect to r’, over the whole 
space outside the bounding surfaces, and with respect to 7’ from f to ty, 
(a value slightly greater than t). We then use Green’s theorem to trans- 
form the volume integrals of the Laplacian terms into surface integrals, to 
get 


[" dt' | as low 1S Gnr, t—t')—Gi(r,r’, Ogee) 
+h | ar |Gerr’ 1) Soe) OZGer, 1) |" 


=f dr’ dt'So(r',')G(r,r',t—-1') —d(r, 2) (5.173) 


266 Propagation and Scattering of Waves 


If we define the Green’s function by the boundary condition dG/dn = 0 
over the bounding surfaces, substitute the boundary condition (5.167) and 
the initial condition (5.166), and choose the potential ¢ to be zero at t 
(since the potential is in any case only determined to within a constant), 
we obtain for $(r, ¢) the formula 


(r,t) = f Sole’, t')G(r,r',t—2') d3r' dt! 
fo dt’ f aS'vo(r',1")G(r,e',t—7') 


+f G(r,r’,t—t)So(r', to) d?r’ (5.174) 


The first term is the expected source term, the second is a contribution 
from the boundaries, showing that they act like sources, and the third is 
the contribution from the initial disturbance. All contributions are deter- 
mined in terms of the same Green’s function. 

If the medium is taken to be infinite and source-free but initially in 
motion, the corresponding terms in (5.173) may be neglected and the G 
may be taken to be that of Eq. (5.75), namely 


Gi(r,r’,t—t') =——1— 5(r-e eh) 
An|r—r’| Cc 


This may be substituted into (5.173) to give 


(r,t) =z | aes) s(n", ty) dr’ 


-_ 
5:0 HH") oe) 6.175 


Putting r’ —r = R, the first integral may be written 


1 1 R 
Aa R(t 4) se +R, 1) aR 


+Fatg \r— 7 


But 
1 
| S(r +R, ft) dOr 


where dQ,» is the element of solid angle subtended by the vector R, is the 
average value of s over a sphere of radius R about the point r; we call it 
$,(¥, to). Using this notation, the integral becomes 


Problem 5-14: Solve for the velocity in the sound 
wave subject to the conditions: 
(a) normal components of velocity V-n,;=V,(t) given on the ith 
surface; 
(b) initial condensation so(r) given; 


12. Solution with Initial and Boundary Conditions 267 


(c) initial velocity V,(r) given. 
In terms of the Green’s function satisfying G = 0 on the bounding sur- 
faces, show that the velocity is 


t 
=m ' 1\r _O a oe ' 
V [ dt 2 J, V,,(t')n one Gir,r’,t—t') dS 


+| d°r'so(r’ )V'G(r, 0-[ ave) ZGe,r'.0 


the initial time being t = 0. 


R 
| : (1-4) Rsp(¥, to) dR = c?(t — to) Sct—toy(¥ Lo) 


To evaluate the second term in (5.175), we note that 


0 R\_ a = _R 
£31 —*) = ct a(t to *) 


This term then takes the form 


jyt 9 sf, , _) gs 
Fac | ROU+R 4) Healt to 5) PR 


=1] dn (0 f0) Re 6(t—-4—5) dR 
=—1 | (e-- 4) -& (Reale, 49)) dR 


0 
or ( $n( . ) R=c(t—to) 


Substituting the values of the two integrals back into (5.175), we obtain 
the formula 


re) 
(r,t) =| Rent, to) —e (Ral 4)) (5.176) 
OR R=c(t—to) 
which expresses the velocity potential at a point in terms of the average of 
appropriate quantities (namely, @ and s = 1/c?(d@/dt)) over a sphere of 
radius R surrounding the point at a time earlier by an amount equal to the 


time required for a sound disturbance to travel from any point on this 
sphere to the point in question. 


Problem 5-15: Calculate the  infinite-medium 
Green’s function for the one-dimensional (‘“‘plane’’) problem 


Fh 19 _ 
dz” Cc? or? 


268 Propagation and Scattering of Waves 


Problem 5-16: Using the relation (3.178) relating 
‘“‘point source”’ to “plane source’”’ solutions of a linear equation, obtain the 
formula (5.93) for the three-dimensional Green’s function. 


Problem 5-17: Using (3.180), obtain the ‘“‘spherical 
Shell’? Green’s function (that is, the solution for a unit spherical shell 
source). 


13. Waves in Guides and Enclosures 


Sound waves in gases inside an enclosed volume may only have 
certain definite frequencies. Consider, for example, a box of dimensions 
a, X ad, X ag with perfectly rigid walls, so that the normal velocity must 
be zero over the surface. It is most convenient to take the boundaries to 
be defined by x = 0, a,; y = 0, a.; z= 0, a,. We can solve the wave equa- 
tion by separation of variables; if the time variation is e~”’, the equation 
for the velocity potential becomes 


2 
Vot+so=0 (5.177) 


Putting ¢ = X(x) Y(y) Z(z), substituting in the equation, and dividing 
by XYZ yields 
1PX 1aPY 1d°Z ot _ 
X dx? “Ydy ° Z d2 °c? 
Thus the first three terms must be constants, and their sum equal to — (w2/ 


c*). For the boundary conditions to be satisfied, these constants must be 
negative: 


0 


oF =— 0X (5.178a) 
Gra BY (5.178b) 
ce = _~Z (5.178c) 
2+ P+ = ole? (5.179) 


The solution to the first must be 
X =Acosax 
where aa, = Iz, | being an integer. Similarly 


Y = Bcos By, Ba, = ma 
and 
Z=Ccos yz, ya, = ni 


13. Waves in Guides and Enclosures 269 
m and n also being integers. Finally, substituting in (5.179) we find the 
possible circular frequencies w: 
2 m? n2 
w? = wc? | —s +3545) = Winn (5.180) 
1 3 


What happens if one attempts to create a disturbance with a different 
frequency w, inside the enclosure, with a source of frequency wz ,, let us 
say? The equation now becomes 


=Vib + 5 oo = (rr) ee (5.181) 
Put d = ¢o(r) eo, so that 
2 
—V2$9——F bo = 3(r 40) (5.182) 
It is possible to expand @y and the 6 function in Fourier series of the form 
= LAX og MAY cog MM 
dy = S Aimn COS a cos a cos z (5.183) 
= LAX 55 cos MM 
5(r—ro) = S) Bimn Cos iE cos z COs z (5.184) 


In the latter, multiplying through by one of the expansion functions, in- 
tegrating, and using orthogonality, it is found that 


Binn = cos TRG cg TY INTO cog Zo (5.185) 
Q, 423 Qy; Qe ag 


where Xo, Yo, and Z) are the components of ro. 
Substituting in (5.182) and equating coefficients gives 


2 2 —1 
a Ea Tv mea nn aw oe 8 a lirxXo cog MEM cog NA% 


a;? a,” az le A, A, Az a, A, As 
(5.186) 
Therefore, 
8 Cc? larx mir ni 
Aig 7 COS 2 cos ——2 wibLP A) cos “2 (5.187) 
A, Az Az Wiern — Wy” Qa, a> a3 


Thus, in general, disturbances of any frequency can be sustained by a 
source of that frequency. There is a ‘‘resonance,” however, when op» = 
Wimn, that is, one of the “‘natural’’ frequences or normal modes. [The dis- 
turbance does not, of course, become infinite; the equations leading to 
(5.187) are only valid for |s| = (wo/c?) || small.] 

What then happens to this disturbance if the source is removed? 

Clearly, the patterns corresponding to the various normal modes 
having been established, each will continue to vibrate with its appropriate 
frequency Wynn- 


270 Propagation and Scattering of Waves 


Problem 5-18: Find the frequencies of the normal 
modes of vibration of a gas in a cylindrical enclosure of radius b and length 
a. If 2b=1ft and a= 1 ft, find the six lowest frequencies, and describe 
the patterns of condensation in each mode. 


Problem 5-19: Oneend of aclosed cylinder of radius 
b and length a undergoes small vibrations of amplitude €f(r) cos wot. Find 
the resulting sound wave inside the cylinder. 


14. Spherical Enclosure 


We have already seen [Eq. (5.116)] that the solutions of the wave 
equation in spherical coordinates are 


b = ulkr)Yim(9,¢) e™ 
where k = w/c and z, is an arbitrary spherical Bessel function. Since the 
solution must be finite at r= 0, Z, must be the Bessel function j;. If the 
spherical enclosure is of radius a, the boundary condition that the radial 
velocity is zero at r = a, Viz., 


yields for the frequencies the equation 
j € a) =0 (5.189) 


Thus, the frequency for a given / is independent of m. This is not surpris- 
ing, since if we merely rotate coordinates, Y,,(9,¢) goes into a linear 
combination of Yin (0’,¢') (m' =—l,...,]), and (6’,¢') are the new 
coordinates; but of course, such a rotation does not affect the frequency. 


15. Propagation Down a 
Cylindrical Tube 


In cylindrical coordinates = V,?24 y?, z and »=tan™' y/x, the 
spatial wave equation (5.177) becomes 


1a (,36) , 18S Fb oF 
eee\Cae) eae toe es 0 (5.190) 
Putting 
@ = F(€)G(¢)Z(z), (5.191) 


dividing through by @¢, and imposing the condition that the resulting func- 
tions of €, y, and z must be constants, we obtain 


15. Propagation Down a Cylindrical Tube 271 


1d 2 
aes, -GF = —o2F (5.192a) 
oa ~ BG (5.192b) 
2 
oF =-Y2 (5.192c) 
and 
@ 
Saat? (5.193) 


Since G must be single-valued, 6 must be an integer m, and 


G = G, cos m(y~— gp) (5.194) 
Gy and gp being constants. The solution of (5.192a) is then 
F = Fy Jm(aé); (5.195) 


the second solution of the Bessel equation must be rejected because of the 
requirement of finiteness at € = 0. To satisfy the boundary condition of 
zero radial velocity at the radius € = a of the tube, we must have 


Jin(aa) =0 (5.196) 
which gives a set of values a= (A,,;/a) which can be obtained from 
standard mathematical tables.’ 

If the frequency wo is fixed, y is now determined by (5.193): 


=e = 2 Ams 


If the right-hand side is positive, the wave is propagated without attenua- 
tion down the tube, the solution being 


®=A/J,, (rns) cos m(~— go) e%%ms?— % (5.198) 
If, on the other hand, the right-hand side of (5.197) 1s negative, 
ee ee ee (5.199) 
and the solution has the form 
b = AJm (Ane ‘) COS M(y — Go) ET *ms? e~ (5.200) 


In this case the wave is attenuated as it proceeds down the tube. 


2See, for example, Morse and Feshbach, Methods of Mathematical Physics. New 
York: McGraw-Hill, 1953. p. 1576. 


272 Propagation and Scattering of Waves 


It may be seen from the foregoing that as a, the radius of the tube, 
decreases, more and more of the modes of propagation are attenuated. In 
fact, for sufficiently small radius, only one mode of propagation remains. 
For we note that, in the case m = 0, (5.196) is satisfied by a = 0; in this 
case, the solution has neither radial nor angular dependence but is simply 
the plane wave 


b =A eiktzi0.~0 (5.201) 


That this is a solution is quite evident from the fact that 0@/0& = 0 every- 
where, and is therefore satisfied on the sides of the tube. Thus, the 
propagation of a plane wave down the tube, with wave front parallel to 
the axis, 1s always possible. 

The following are the first few zeros of the derivatives of the Bessel 
functions of low order: 


Nor = 3-83 Agp = 7.02 —Agg = 10.17 
hi = 1.84 Ay = 5.33. gg = 8.54 
Noy = 3.05 op = 6.71 gg = 92.97 
her = 4.20 Ago = 8.02 


Thus, if the circular frequency is lower than A,,c/a = 1.84c/a only a plane 
wave will fail to be attenuated; or alternatively, for given frequency, only 
a plane wave will be propagated if a < (1.84c/w). Taking the velocity of 
sound in air at normal temperature and pressure to be c = 1100 ft/sec, 
and if the frequency is 500 cycles/sec, we find this limit on a to be about 
7.73 in.; as the frequency changes, the limiting value of a changes in 
inverse proportion. Thus, for any frequency, a sufficiently small pipe 
“filters out” all but the plane wave component. 


Problem 5-20: Calculate the limiting dimensions for 
propagation of a sound wave other than a plane wave, of circular fre- 
quency w, down a rectangular pipe. 


16. Scattering of Sound Waves 


We consider the problem of a plane wave of a given circular fre- 
quency w incident on a scattering object in the neighborhood of the origin 
of the coordinate system. The polar axis is taken to be in the direction 
of propagation of the incident wave. The velocity potential for this inci- 
dent wave is then 

Pine = eitkz—ot) (5.202) 


The amplitude may be taken to be unity, since the amplitude of the 


16. Scattering of Sound Waves 273 


scattered wave, and therefore of the complete disturbance, is proportional 
to that of the incident wave. 

In order to understand the scattering problem, it is useful to express 
the plane wave in terms of spherical waves, using (5.122): 


Pine. = > i'(2/+ 1)j,(kr) P, (cos 0) ett (5.203) 
l=0 


Furthermore, as shown in connection with Eqs. (5.106) and (5.107), if 
we write j,(kr) in terms of h{?(kr) and h{(kr), 


jilkr) = 2[h® (kr) +h (kr) ] (5.204) 


the two Hankel functions represent waves, respectively diverging from 
and converging on the scatterer. 

If the complete velocity potential outside the region occupied by the 
scatterer is @, we define the scattered wave as 


Psc = d ~ Pinc. (5.205) 


The scattered wave will involve only partial waves diverging from the 
scatterer and the origin; it will, therefore, be of the form 


hse = DY Combi? (kr) Yim (0, ge) eo (5.206) 


If we consider only the special case for which the scatterer has axial 
symmetry about the polar axis, only the m= 0 terms appear. The scat- 
tered wave may then be written 


bsc=t > (W,—-1)i' (214 1)hY (kr) P; (cos de (5.207) 
The total velocity potential for the problem takes the form 
o=t > &(214+1) (Wi hf (kr) + hh (kr) ]P)(cos 0)e* += (5.208) 


The values of the constants W, are determined by the fact that @ 
must satisfy boundary conditions at the “‘surface’’ of the scatterer. For 
sound waves, this “‘surface’”’ is generally a physical boundary. The 
boundary conditions depend, as we shall see, on the physical nature of 
this surface. Problems quite similar mathematically to that of the scatter- 
ing of sound waves occur in electromagnetic theory and in quantum 
mechanics. In electromagnetic theory, the situation is complicated by the 
fact that the wave has a vector character. Aside from that fact, the main 
feature distinguishing the various scattering problems is the difference in 
boundary conditions imposed by the physics of each system. In quantum 
mechanics, in particular, as we shall see later, we seldom have well- 
defined boundary surfaces, inside which the problem has a different 
character (and the same is true of electromagnetic theory). Neverthe- 
less, the solution inside the scatterer can in each case be incorporated 


274 Propagation and Scattering of Waves 


into a boundary condition at a “surface,” the definition of which may, 
however, involve a certain arbitrariness. 

For the moment, we shall be concerned with some results of a general 
character, for which the amplitudes W, need not be specified. These 
results then apply equally to the scattering of sound waves, of electro- 
magnetic waves, or of ““quantum”’ (particle) waves. 

These results are formulated in terms of “cross sections.’ The 
“scattering cross section”’ o, is determined by the definition 


scattered flux 


incident flux per unit area eve 


The word “‘flux”’ here incorporates the idea of “‘per unit time.”’ One must, 
however, specify flux of what? Here again, the situation differs according 
to the physics of the problem. For sound waves, we consider flux of sound 
energy, so that the flux over a surface is 


=(— { v9, -mds) (5.210) 


the integral being taken over the surface. @, means the physical, or real, 
part of @. The brackets ( ) indicate a time average over a sufficiently 
long period of time (for example, for a periodic wave, over many periods). 
The cross section defined by (5.209) has the meaning that the rate of 
energy flux of the incident wave over o,, is equal to the total energy flux 
in the scattered wave; it is as though all the energy incident on the area 
Os Were scattered. 
Let us calculate the flux per unit area in the incident wave (5.202). It 
is 
Pow sin (kz — wt)k sin (kz — wt)) = spok?c (5.211) 
since w = kc. On the other hand, the scattered flux at large distances from 
the origin may be calculated from the asymptotic form of @g¢¢: 
eilkr—(4+17/2)] 
6S) >) (Wi) OC) P, (cos 0) ew (5.212) 
Putting 
W,-1= |W,-1e*: (5.213) 


the real part of this is 
] 1 
(as) == 5 »y |W,—1](21+ 1) PP (cos 8) cos (8,-+ kr—-5—or) (5.214) 
Then the asymptotic value for the excess pressure po(d@/dt) is 


pio) = 5 poe >. |W.— 1 (21+ 1)P; (cos 4) sin (8+ kr—5— at] (5.215) 


16. Scattering of Sound Waves 275 


and for the radial velocity —d¢/dr Is 
_(2oV 11 W,—1|(21+1)P, (cos 6) si +kr—= 5.216 
ar =57 | i—1| )P; (cos 6) sin | Bi +kr 7 tO. ) 


These must now be multiplied together, and the time average of the inte- 
gral of the product calculated over a sphere of sufficiently large radius. We 
may use the fact that the Legendre polynomials are mutually orthogonal, 
and that the time average of sin? (8,+ kr— (2/2) — ot) is 3. Therefore the 
scattered flux is 

PE =fpoc4a > |W,—1|?(21+ 1) (5.217) 


Dividing (5.217) by (5.211) we find that 
Ose = Fa >, (Mi 1P(214 1) (5.218) 


We see that the scattering cross section is made up of distinct contri- 
butions from each order of spherical harmonic. The separate terms will be 
designated as the “‘partial cross sections”’ of different multipole orders. In 
the electromagnetic and quantum cases, the terms with various / are 
associated with different angular momenta about the origin. Such an 
identification is less apt in the case of sound waves. 

From (§.212), we see that 


ikr—wt) 
(as) — € 


s¢ > J (9) (5.219) 


where the “‘scattering amplitude’”’ is 
] 
The “forward scattering amplitude”’ is 
_ 1 _ 
(0) =s7 > (21+ 1)(W,-1) (5.221) 


since P,(1) = 1 for all values of 1. This formula will be found useful 
shortly. 

If we look at (5.208), we see that the total velocity potential can be 
split into a part converging on the origin 


heony = & >, (21+ 1) hi (kr) P, (cos 6) e~@ (5.222) 
and a part diverging from the origin 
PDaiv = 3 Ss (i+ 1)WhY (kr) P, (cos 0) eT wt ($.223) 


If the scatterer is removed, W, = 1. The energy flux in the diverging wave 
is easily obtained by observing that @4;, differs from @,.. only in having W, 


276 Propagation and Scattering of Waves 


in place of (W,—1): thus, the flux is 
Pay =5 Poc Ss |W |?(27+ 1) (5.224) 


On the other hand, the flux in the converging wave is in the opposite 


direction, [i.e., kK > —k in (5.215) and (5.216)], but otherwise differs from 
(5.224) only in that W, is replaced by unity: 


Peony = 5 po S, (21+ 1) (5.225) 


Summed over all /, this diverges; this is not really surprising, in that the 
plane wave, strictly speaking, contains an infinite amount of energy since 
it is of infinite extent perpendicular to the wave front. In practice, how- 
ever, we are only interested in that part of the plane wave which is affected 
by the scatterer. Suppose, therefore, that the scatterer lies entirely within 
a sphere of radius R around the origin. We can then ignore those partial 
waves in the expansion of e*** which have negligible amplitude for r < R; 
that is, those for which j,(KR) is negligible. From the results of Appendix 
5C, this is seen to be the case when / is substantially larger than KR. We 
therefore, cut off all of the series at a finite value / = L > kR. 
If the ““scatterer’ cannot absorb energy from the wave, the flux in the 
diverging wave must equal that in the converging wave for each /; that is 
|W.| = 1 (5.226) 
or 
W, = e2%1 (5.227) 


(The factor 2 in the exponent is chosen for convenience, as will be seen as 
we proceed.) The angle 6, is called the ‘“‘phase shift for the /th partial 
wave.”’ In terms of this phase shift, it is easily seen that the scattering 
cross section Is 


os, = me S |e?1— ] |2(20+ 1) 
=% S) (21-+1) sin? 6, (5.228) 


From (5.213) and (5.214), the asymptotic form of the velocity potential is 


(as) = t S) (21+ 1) sin &,P; (cos 6) cos (xr-+ 5,5 —or) (5.229) 


If the “‘scatterer’’ is also an absorber, the fiux of the diverging wave is 
less than that in the converging wave, and the inequality 


Wi) <1 (5.230) 


17. Inequalities Satisfied by the Cross Sections 277 


holds. The absorbed energy will be the excess of the flux in the con- 
verging wave over that in the diverging wave: 


Pans = 5 poe S, (21+1) (1—| Wil?) (5.231) 


We may then define the “‘absorption cross section” 


Paps 


C er 
abs incident flux per unit area 


] 


= > (+1) 0-|M?) (5.232) 


This result, which relates the total cross section to the forward scattering 
difference between |W,| and 1 approaches zero very rapidly when / > L. 
Finally, it is possible to define the ‘total cross section” 


Stot = Ose + Wangs (5.233) 
= 72 >, (21+1){|M.- 12+ 1- |W, 1} 


_ 20 


ia Re 5 (2/+1)(1-) (5.234) 


“Re” signifying, as usual, “‘real part of.’ Comparing with (5.221), we 
may write 


Sot = 7 Imf (0) (5.235) 


This result, which relates the total cross section to the forward scattering 
amplitude, is known as the ‘optical theorem” of Bohr, Peierls, and 
Placzek. 


17. Inequalities Satisfied by 
the Cross Sections 


_ Consider now the partial waves for each /, designating the partial 
scattering absorption, and total cross section, respectively, as 


=F (21+1)|W,—1|2 =x, (5.236) 


os = Ga (21+ 1) (1—-|Mil?) = (5.237) 


278 Propagation and Scattering of Waves 


and 


oto = = (21+ 1) Re (1-W,) =zZ=xt+y, (5.238) 


The fact that both the scattering and total cross sections can be expressed 
in terms of (1 — W,) leads to an interesting and useful inequality, namely, 


[oe]? of 
[ (2n/k) (2141) ]? ~ (alk) (2141) 
Writing 
mE (21+1) =o, (5.239) 


this may be written 
[of ] 2=< <= 4¢,0 
or 
(x+y)? = 40% (5.240) 


This inequality may be represented graphically by taking x, and y, as 
coordinates along orthogonal axes. The allowed values of the cross 
sections are then represented by the points lying between the x; axis and 
the parabola 


(x, + y)? = 401% (5.241) 
as illustrated in Fig. 5.2. 
yt 
07 
Oj 40; x] 
Figure 5.2 


Thus, the maximum possible value of o@ is 40,= (4a/k?)(2/+1); 
of o,, o, = (a/k?) (21+ 1); furthermore, if o&, has its maximum value, 
ao? must have the same value. 


17. Inequalities Satisfied by the Cross Sections 279 


Somewhat less rigorously, we can obtain limits on the complete 
cross sections (the sums over all values of /). 

Assuming that the scattering of all partial waves beyond /= L can 
be neglected, we form L-dimensional vectors Z, X, and S with com- 
ponents Vz,, Vx,, and Vo;, respectively. Since, by (5.241) 


z= 2Vx, Vo; (5.242) 


it follows on summing over / that 


Z?7<=2X:-S (5.243) 
where the right-hand side is a scalar product. But, therefore, 
Z? <2|X||5| (5.244) 
In terms of components this may be written as 
Om < 2Vox \z 5 (21+1) (5.245) 
Squaring both sides, ~ 
Oe = 4 (L+1)?o, (5.246) 


This result is exact; the question is, however, what is the smallest 
value of L for which it is valid? As seen in Appendix 5A, the value 
should be taken somewhat larger than kKR. However, since /=kR di- 
vides the strongly scattered partial waves from the weakly scattered ones, 
and the additional contributions beyond /= kR may be considered to be 
compensated by a diminished scattering as / approaches kR from below, 
the choice L = kR is in fact not unreasonable. This leads to 


k 


More detailed analysis shows, in fact, that this inequality is rather well 
satisfied for all values of R: even as R — Oitis correct, since then only the 
[= Q wave is important. For kR > 1, 1/k may be neglected relative to R. 
The inequality (5.247) may be represented graphically in the same 
manner as were the inequalities for the partial waves. It follows, then, that 
the maximum absorption cross section is, for scatterers whose dimensions 
are large compared with the wavelength, equal to 7R?, that is, the dia- 
meter of the scatterer. This seems a reasonable result: a completely 
absorbing body is one which absorbs all of the wave which impinges on it. 
But for maximum absorption, we have seen that the scattering cross sec- 
tion must be equal to the absorption cross section. This result is also 
reasonable, for we know that, for wavelength short compared to the 


2 
2 ae (R +4) ae (5.247) 


280 Propagation and Scattering of Waves 


dimensions of the scatterer, there is negligible diffraction into the region 

immediately behind the scatterer, which shadows this region. Thus, there 

must be a scattered wave which interferes destructively with the incident 

wave in this region. It must, therefore, have the same intensity as the 

incident wave would have in this region in the absence of the scatterer. 
Thus, the total cross section for a perfect absorber is 277 R?. 


18. Elastic Scattering by Small 
Sphere 


Let us consider the problem of scattering by a hard sphere of radius 
‘‘a’’ which does not absorb sound. By virtue of (5.208) and (5.227) the 
velocity potential of the problem has the form 


o=t ¥ &(214 1) [eh (kr) + hy (kr) ]P; (cos @) e~* = (5.248) 


The phase shift of 6, is determined by the boundary condition; once it is 
known, the cross section may be written down immediately. But at the 
surface of a hard sphere the radial component of velocity must be zero; 
we must therefore have 


eh (ka) +h 2" (ka) =0 (5.249) 
If we write 
h,?” (ka) = VUji (ka) ]?+ [ni (ka) ]? e” (5.250) 
hy?” (ka) = V{ji(ka) J? + [nj (ka) J? e (5.251) 
where ' 
_ ji (ka 
608 Yt = 7G (ka) |?-+ [ni (ka) F 06.252) 


Eg. (5.249) takes the form 
e2i8) + e2i7, = 0) 

so that 

a 


o= 5 


+ Yi (5.253) 
By (5.228) the scattering cross section is then 
Osc = a Ss (21+ 1) cos? y, 


_ Aq (21+ 1) Li (ka) |? 
= "72 2 Ti (ka) 2+ [ni (ka) (5.254) 


When ka is small, the series converges rapidly. In order to find an 
approximate form, consider the behavior of j;(x) and nj(x) for small 
values of x. From (4.339) 


19. Propagation of Electromagnetic Waves 281 


Ma 
WO) ~ sar H* 
and 
; Val 
ji(x) = ra ae all (5.255) 


For the case / = 0, since the leading term in j, is a constant we must go to 
the next term in its expansion. It is easily verified that 


Jo(x) = 3x (5.256) 


The n; terms in the denominator become very large compared with the /; 
ones. It is easily verified from (4.377) that for small x 


mx) = — So 5 
and 
n(x) = eo) Hs (5.257) 
It follows that 
cos y; = constant x24! (5.258) 


except for / = 0, for which 
COS Yo = — 3° (5.259) 


The != 1 term also behaves like x* for small x: Each further term is 
smaller by (ka)? and can, therefore, for small ka, be neglected. But 


cosy, = $x? (5.260) 
The approximate value of the cross section when the radius a of the 


scatterer is considerably smaller than the wavelength is 


Cue = aE (kaye+ 5 (ka)*| 


= 5 kta (5.261) 
Thus, the cross section is proportional to the sixth power of the radius of 
the scatterer (Rayleigh’s sixth-power law) and as the inverse fourth power 
of the wavelength of the incident wave. 


19. Propagation of 
Electromagnetic Waves 


We now develop the theory of electromagnetic wave propagation on 
the basis of Maxwell’s equations. These equations have already been 


282 Propagation and Scattering of Waves 


given in Chapter 3. They are 


V-D=p (3.14) 

V-B=0 (3.15) 
_ 0B 

vxE=—-2 (3.16) 

vxH=j+2 (3.17) 


These are to be taken in conjunction with the “‘constitutive equations”’ 
D= cE (3.18) 
B= wH (3.19) 


We treat only uniform media, for which e and pw are constant. The form of 
the equations is that for rationalized MKS units; we do not concern 
ourselves here with various systems of units which are summarized in 
Appendix 5B. We note merely that it is possible to write 


E= €'& (5.262) 


= M' Ho (5.263) 


where €, flo are the dielectric constant and the magnetic permeability of a 
vacuum, and e’, yw’ are called the specific dielectric constant and magnetic 
permeability, respectively. 

The quantities p and J are electric charge and current density, 
respectively. A relation may be developed between them by making use 
of the fact that the rate of change of amount of charge within a volume V5, 
namely, 

0 


ale 3 
ys Pua 


is equal to the rate at which it flows in over the boundary, 
— J J-ndsS 


the integral being over the surface surrounding V, and n being the normal 
drawn outward from the surface. Using Gauss’ theorem to change the 
surface integral into a volume integral, the ‘““equation of continuity” has 


the form 
op : = 
| (ee J) dr 0 


where the integral is over an arbitrary volume; consequently 


24¥ -J=0 (5.264) 


19. Propagation of Electromagnetic Waves 283 


Let us first consider p and J as sources of the electromagnetic field, so 
that (3.14) to (3.17) are the equations determining the field produced by a 
given system of charges and currents. This is the viewpoint of the theory 
of radiation, where p and J specify the properties of a radiating system — 
an ‘“‘antenna.”’ We might then call p and J the “external” charges and 
currents. There is, of course, an artificiality about this viewpoint, since 
charges are themselves in turn acted upon by fields; in the radiation prob- 
lem, this reaction back upon the sources of field is known as “radiation 
reaction.”’ Under appropriate circumstances, it may be ignored; the pro- 
cess here is the familiar one, employed in all areas of physics, of “‘isolat- 
ing’ a particular system from the whole of nature for study. 

On the other hand, we may be interested in electromagnetic propaga- 
tion in a medium containing charges and currents; in such a case, p and J 
describe the medium. In that case, in Maxwell’s equations e and yu should 
in the first instance be taken to have their vacuum values €) and po. The 
charge and current then not only determine the fields but are in turn 
determined by them, through the equations of motion of charged particles 
in a field. Thus, the problem is one of the behavior of a closed system 
under interaction between its parts. 

In this case the electric current is normally determined by ‘“‘Ohm’s 


99 


law 
J=oaE (5.265) 


In some circumstances (e.g., In an external magnetic field or in an aniso- 
tropic medium), o may be a tensor; that is to say, Ohm’s law may take the 
form 


J; = oyE; (5.266) 


(i, j= 1, 2, 3). Once J is known, p can be determined from the continuity 
equation (5.264). 

It should be noted that Eqs. (3.14) and (3.15) are not completely 
independent of (3.16) and (3.17). Taking the divergence of (3.16) leads to 


0 _ 
marvel B=0 (5.267) 


We may consider (3.15) simply as providing an initial condition for (3.16). 
Similarly, the divergence of (3.17) gives 


0 _ 
V:-jJ+ ry V:-D=0 
which, along with (5.264), gives 


2 (V-D—p) =0 (5.268) 


284 Propagation and Scattering of Waves 


Equation (3.14), in turn, provides an initial condition for this equation. 
Provided that these initial conditions are satisfied, we need only use (3.16) 
and (3.17) to develop the theory of electromagnetic radiation. 
Let us now determine the equations of wave propagation. Taking the 
curl of (3.16), and using the identity 
Vx(V Xv) =V(V-v)—V2v (5.269) 
we obtain 


V(V-E)-VE=—2V xB 


Substituting from (3.14) and (3.17) this equation becomes 
Pp of — — er) 
V (2)- —VWE=-p=— ry; AG +e 


_we vE__ , I _1 
V7E+ en Loe ver <VvP (5.270) 


Taking the curl of (3.17) and using the other equations, we find that 


or 


V(V-H)-VH=VxXJ+2VxD 


or 
_v2 Ca ) 
V?H = VxI+ed/ ar 
so that 
2356 ?’H 
V7H+ eu > a2 =VxJ (5.271) 


In a region free from external charges and currents, the right-hand sides 
of (5.270) and (5.271) are zero, so that both E and H satisfy wave equa- 
tions of the form 


vit ep ot of = (5.272) 


A general solution of this equation may be written [see (5.16)] 
f=F(n-r—ut) (5.273) 
where F is an arbitrary function and 
] 
Veu 
is the wave velocity. From (5.262) and (5.263) 


k= 


b= = 5.274 
eu Veo Vee on 


19. Propagation of Electromagnetic Waves 285 


where c is the velocity of light in vacuo, with a value very close to 3 X 10?° 
cm/sec. The quantity 


n=-=Vep (5.275) 


is the “‘index of refraction” of the medium. 

Because of relations (3.16) and (3.17) which relate E and H, (5.270) 
and (5.271) may not be solved independently. But just as, in electrostatics, 
a vector quantity (the electric field) may be expressed in terms of a scalar 
potential @, so in this case the components of the electric and magnetic 
field, six in number, can be expressed in terms of a smaller number of 
quantities. Since it is a standard result of vector analysis that a vector 
field whose divergence is zero can be expressed as the curl of another 
vector, it follows from (3.15) that B may be written as the curl of a vector 
A, known as the “‘vector potential”’: 


B=VX<A (5.276) 


If this is now substituted in (3.16) that equation may be expressed in the 
form 


Vx (E +S) —9 (5.277) 


But since any irrotational vector can be expressed as the gradient of a 
scalar potential, we may put 


aA 
E+4—_v¢ 
or 
E =—vo—%4 (5.278) 


Thus, E and B can be expressed in terms of the vector potential A and the 
scalar potential ¢. 

Equations (5.276) and (5.278) are completely equivalent to the Max- 
well equations (3.15) and (3.16). 

The equations satisfied by A and @ must be determined by the remain- 
ing Maxwell equations (3.14) and (3.17). Substituting in them, we obtain 


_vw2p 20.4 —P 
V*ob ar A Fs (5.279) 
and 
I a gA\_ : 
TVX (V XA) +e% (6458) =j (5.280) 
or 
1 0?A 


1 ad 
— 02 z — 
V2A+V (v Atse oe) tae a = eS (5.281) 


286 Propagation and Scattering of Waves 


In these equations A and @¢ are coupled. The situation may, however, be 
simplified by noting that (5.276) does not determine A uniquely. In fact, 
the vector B may be equally well represented by the vector potential A’ 
where 


A’ =A+Vx (5.282) 


where x is an arbitrary function. If we can take advantage of this arbi- 
trariness to choose a vector potential such that the equation 


eo lod —— 
V-At+e, fe =0 (5.283) 
(called the ‘“‘Lorentz condition’’) is satisfied, (5.279) and (5.281) become 
10*d_p 
—V2 —_ Ff — 
V*b+ ore (5.284) 
and 
1 077A 
—U2 —_ —_— = 
V A+; FYE = pj (5.285) 


that is, inhomogeneous wave equations. 

Is it then always possible to choose our potentials to satisfy (5.283)? 
We note that, although the transformation (5.282) leaves B invariant, it 
will not do so for E unless ¢ is also simultaneously changed. If, however, 
¢ is transformed as follows: 


t= g—2X 
¢'=o-F (5.286) 


E and B will both be invariant under the simultaneous transformations 
(5.282) and (5.286). Such a transformation is called a “‘gauge transforma- 
tion” on the potentials. 

If, finally, the Lorentz condition (5.283) is to remain satisfied, that is, 
if 


is to follow from the corresponding equation in the unprimed quantities, 
x will have to satisfy the equation 


2, 1 9°x _ 
Vy¥—-—~“X=0 (5.287) 


that is, x will also have to satisfy the wave equation. This condition 
defines the permissible gauge transformations which leave invariant the 
wave equation for the potentials. 

Equation (5.283) implies, of course, that to specify the fields in terms 


19. Propagation of Electromagnetic Waves 287 


of A and @¢ is still redundant, since ¢ and A are again related. If we intro- 
duce the ‘“‘Hertz vector” I such that 


_ alll 
A=pu >) (5.288) 
and 
$= ty Tl (5.289) 


equation (5.283) is satisfied automatically. If at the same time we 
make use of the continuity equation (5.264) to express p and J in terms of 
a ‘‘polarization vector” P, 


p=—-V-P (5.290) 
and 
_aP 
j= ry ($.291) 


substituting (5.288)-(5.291) into the wave equations (5.284) and (2.285) 
gives 


1 3 
.J(—y24+—2 |) 7—-ph= 
V {( V +7 =) II P| 0 (5.292) 
and 
a (-v2+ 555) m—p}=0 (5.293) 
or u? dt? ; 
Thus, if we choose II to satisfy the equation 
1 oil 
—WV?II +72 9p P (5.294) 


it is necessary only to satisfy this equation to ensure, not only the satisfac- 
tion of Maxwell’s equations but also of the wave equations for @ and A 
and the Lorentz condition. It is evident, then, that a single vector is 
adequate to describe completely the electromagnetic field. 

Finally, let us express the fields in terms of Ii. The electric field is 


=—vyqg—2A 
E=—V¢ Pp 
1 o7it 
or 
D=Vx (VxII) —P (5.295) 


The magnetic field is 


_1 =o 
H=7VxXA=7 Vx (5.296) 


288 Propagation and Scattering of Waves 


20. “Spin” of Scalar and Vector 
Fields 


In this section, we develop some results concerning vector fields in 
general, with a view to the analysis of the electromagnetic field in terms 
of potentials. The ‘“‘spin” of such a field is defined, for the present heuris- 
tically, but in a manner convenient for use in connection with the quantum 
mechanics of fields, and it is related to the concept of “‘polarization’’. 

Consider first a scalar field f(r) = f(x, y, z) expressed as a function of 
coordinates related to axes §. Let unit vectors along these axes be i, j, 
and k. Consider now an infinitesimal rotation of the coordinates through 
an angle 56 about the direction of the vector n, and designate the unit 
vectors along the new axes as i’, j’, k’. Then we know that, to the first 
order in 60, 


i'—i=80 nXi (5.297) 
j—j=60 nxj (5.298) 
k’—-k=680 nxXk (5.299) 


Let us calculate next what happens to the coordinates of a fixed point 
under this transformation. If the new coordinates are x’, y’, z’, it follows 
that 

xi +y'j +2'k’ =xit+yj+zk (5.300) 


Writing the function f in terms of the transformed coordinates it becomes 
fix’, y’, z’). We now ask, how has the scalar field function changed under 
the rotation? The rate of change is 


aie ee ea 
36 Ae Se 
where f and f are expressed in terms of the same coordinates. In other 
words, we ask: What is the change in the function describing the field, 
as a function of coordinates, when the coordinate axes are changed by a 
rotation? 
The problem is most simply solved by stating (5.300) in the form 


x(8)i(9) + y(A) j(9) + z(0)k(@) = constant (5.302) 


The primed and unprimed quantities then refer to different values of 0. 
Differentiating with respect to 6 and using (5.297)—(5.299) in the form 


(5.301) 


di _ , 
we obtain 


F+nxr=0 (5.304) 


20. “Spin” of Scalar and Vector Fields 289 


where 
r(0) = x(6)i + y(@)j + z(0)k (5.305) 
the axes being fixed. 
But now, since the field at a fixed point is independent of the axes 
used, 


Ffix',y',z') =flx, y, 2) 


or 
Foe(x(8), y(@), z(8)) = constant (5.306) 
On differentiating with respect to 0 we obtain 
Of OF gee 
70 + 00 Vf=0 (5.307) 
or 
of =nXr:-Vf 
=n-rxvf (5.308) 


For reasons which will become evident later, we introduce the “‘rotation 
operator” — i(df/08) on f, so that 


~itaen Lf (5.309) 


where 

L=-—-irx<V (5.310) 
We shall see that this may be identified with the quantum ‘‘angular 
momentum” for a scalar field. 

Let us now treat, in a similar way, the rotational properties of a 
vector field A(r). In this case, the rotation of coordinates not only changes 
A due to a change in coordinates, but also due to a change in its com- 
ponents under rotation. By analogy with (5.306) we have 


+ E ox(0(9))i(8) + A ou(())5(8) + Aoxie(@))K(6)] =0 (5.311) 


Differentiating out we obtain 


0 +(% V)A+nXA=0 (5.312) 
Therefore, 
0A . . 
ing =i L)A+mxA (5.313) 


The rotation operator now consists of two parts: n- L, as in the case of 
scalar field, and the operator inX, which is known as the “‘spin operator.” 
If the components of A are represented as a column matrix, the spin 


290 Propagation and Scattering of Waves 


operator for the direction n becomes 


This may also be written 


S,=n:S 
Where 
0 0 0 
S,=| 0 0 —-I1fi 
0 l 0 
0 0 1 
Sy={ 0 0 O}i 
— | 0 0 
and 
0 -!1 0 
S,=| 1 0 |i 
0 0 0 
Consider 
S,2A = ®n X [n X (n X A)] 
=—PnXA 
=mnxA 
= $,A 
If we then look for eigenvectors of S,, 1.e., vectors such that 
S,A =A 
the eigenvalues must satisfy the equation 
M=) 
so that 
A=0,+1 


For the case A = 0, 
S,AA=inxA=0 


so that the eigenvector is in the direction of n. 
For \ == 1, 0n the other hand 


mxA==+A 


(5.314) 


(5.315) 


(5.3 16a) 


(5.316b) 


(5.316c) 


(5.317) 


(5.318) 


(5.319) 


(5.320) 


(5.321) 


In this case, it is clear that the vector must be complex, so we write it 


A= A,+iA, 


(5.322) 


21. Interaction of Fields and Particles 291 


where A, and A, are real. Then, equating real and imaginary parts in 
(5.321), 
nXA, =+A, (5.323) 


—n XA, =+A, (5.324) 


One of these equations is, of course, redundant, since they must be con- 
sistent. The eigenvectors corresponding to \ =+1 are, respectively, of 
the form 


A=A,+mxXA, (5.325) 


where A, is an arbitrary vector perpendicular to n. 

If, for instance, n is taken in the z direction and A, in the x direction, 

these eigenvectors become 
A=A,(itij) 
where Ap is a scalar. 

It is, alternatively, possible to express these eigenvectors in terms of 
“polarization.” We say that the vectors corresponding to \==+1 are 
right (+) and left (—) circularly polarized with respect to n and that that 
corresponding to A = 0 is /ongitudinally polarized with respect to n. 


Problem 5-21: Show that Maxwell’s equations in 
empty space may be written in the form 


_ js oF i oF 
Pax, Cc dt 


where F is the column matrix E+ iH and Sz, are suitably defined matrices. 
(CGS units have been used.) 
Verify that 


S1S_—S2S1 = iSs 
S2S3—S3S2 = iS, 
S581 —S1S3 = iS. 

S2+S2+S2=2 


Use this to verify that the “‘spin” of the electromagnetic field is 1. 


21. Interaction of Fields and 
Particles and the Dynamics of 
the Electromagnetic Field 


The most direct physical approach to the dynamics of the electro- 
magnetic field is through its interaction with charged particles. This 
approach rests on the assumption of conservation laws for energy and 


292 Propagation and Scattering of Waves 


momentum (or, in the relativistic case, on the equivalent hypothesis of 
covariance). Thus, if a charged particle gains or loses energy, it is assumed 
to be acquired by the field. In this way, the dynamical variables of fields 
may be identified. 

The fundamental empirical fact is that the force on a particle of 
charge e is 


F = e(E+vxB) (5.326) 


If one is concerned with a continuous (or quasicontinuous) distribution 
of charges, it is convenient to define instead a “‘force density”’ 


F(r) = pE+JxB (5.327) 


whose integral over a volume gives the force on that volume. 

We may now use this force equation, in conjunction with Maxwell’s 
equations, to calculate the various dynamical variables of the field. To 
determine the energy density of the field, we use the relation 


=| (Up + Uy) dr =— | (P,+P,) «nds (5.328) 


where W@,, WU; are the energy densities, respectively, of the particles and the 
field and P,, P, are their energy fluxes, and the integrals are taken over an 
arbitrary volume and its bounding surface. But 


5, | Upa%=—| P,-nds+ | B-Sd*r (5.329) 
The last term represents the rate at which the particles gain energy from 


the field; the second term of (5.327) does not contribute because the 
force is perpendicular to J. From (5.238) and (5.329) it follows that 


S| @a%+ | Py-nds=—| B-Sder (5.330) 


We may substitute from (3.17) in the last integral: 


[e-tar= | E-(vxH—-2) ar 


-—[v-ExHdr+ | H-vxEar—35[D- Ea 


where we have used the identity 
V:-ExH=—-E:-VXH+H-VXE 


Substituting for V X E from (3.16) and using Gauss’ theorem to convert 
the first integral on the right to a surface integral, we find that 


21. Interaction of Fields and Particles 293 


[ B-Jar=—| ExH-nas—2 | (D-E+B-H)d*r (5.331) 


Finally, therefore, 


$ | w,dr—| P-nds =2 | (0D-E+B-H) d*r— | EXH-ndS 


u (5.332) 
It is, therefore, reasonable to interpret the energy density of the field as 


U,=+(D-E+B-:H) 
and the energy flux as 
P,=EXH (5.333) 


The latter is known as the **Poynting vector.” 

Similar arguments may be used for momentum density and flux. 
Since momentum flux density is a tensor, it is convenient to use an index 
notation. We call Q’”’ and Q“” the momentum densities and 7,,”, 7,” the 
momentum flux tensors. Then, by conservation, 


TA (O{?+ O;”) dr = — | (T (P+ TP) n; dS (5.334) 


where we have used on the right-hand side the summation convention of 
tensor calculus, according to which we sum over the repeated index j. 

Now, once again, we write down the momentum conservation law 
for the particles in the presence of the field. The rate at which the par- 
ticles gain momentum from the field is given by the Lorentz force F, so 
we have 


ral a” d= — | Tn; as+ | [pE;+ (JxB),Jd*r (5.335) 
Therefore, 
= fou ar+{ T{Pn; as =— | [pE:+(J xB) dy (5.336) 


We must now substitute for p and J and simplify the right-hand side: 


pE+JXB=E(V-D)+ (VXH)xB—2°xB 


~ E(V-D)—V(B-H) +(B- v)H-2 x B 


Now 
Dx (V XE) =V(3D-E)—(D-V)E 


and in terms of indices 


0;(D;E,;) = (D- V)E;+ E;(V- D) 


294 Propagation and Scattering of Waves 


Also 

(B-V)A; = 0;(A:B;) 
since 

V:-B=0 
Therefore, 
[pE+ J xB]; =,(E,D;) — (D- V)E; 
= 8;(E,D;) + [Dx (V X E)];—a,(4D- E) 

— 3,(3B- H) + 2,(H.B,) ~ (SP xB) 
Substituting V x E = — (dB/dt) in the second term and rearranging terms, 
we find 

[pE+ J xB], = 0;(E,D;+ H,B;) —9;(3D -E+4B- 8) —2 (DxB), 


(5.337) 
Substituting in (5.336) leads to the equation 


d 0 
Al QO; ar | T Pn; dS -2{ (Dx B),d*r+ | afk E+B- H)é, 


—(E,D;+ H;B;)] dr 
The last term may be converted into the surface integral 
[ G@-E+B- H)8,—E,D;— H,B;]n, ds 


We may, therefore, identify the momentum density and flux, respectively, 
as 


Q”= Dx B (5.338) 
and 
TS =32(D-E+B-H)6,,—E,D,;— A,B; (5.339) 
It should be noted that 
QO” = euP, (5.340) 


The above results are valid only if « and u are constant. If they are 
variable, there are additional terms involving their variations. 


Problem 5-22: Calculate the formulas for energy 
and momentum density and flux in the case of varying € and p. 


Finally, an expression may be derived from (5.338) for the angular 
momentum of the field: 


Lo = i rx Q” dr 
= [ rx (Dx B) dr (5.341) 


21. Lagrangian and Hamiltonian of the Field 295 


Problem 5-23: Verify directly the constancy of the 
total angular momentum, that of the field plus that of the particles, in a 
given volume. 


22. Lagrangian and Hamiltonian 
of the Field 


We show first that the Lagrangian density of the free electromagnetic 
field may be written in the form 


=! (pe) 2) ( eal , 
L= » (<x i B op V A+; a (5.342) 
This is to be considered as a function of the potentials and their space and 
time derivatives, which are treated for the present as independent. The 
Lagrangian 1s then the space integral of .7: 


dA; ob 3 


L={2(Ae, Ao. GE Se) d r (5.343) 


and the action principle, from which the equations of motion may be 
derived, is 


tz t2 
8 [ Lar=6{ dt [ Ld (5.344) 
“Sty th 
Proceeding now exactly as in the derivation of Eq. (5.41) for sound 


waves, but making independent variations of A; and q, it is found that the 
equations of motion are 


éL 
3A, = 0 (5.345) 
and 
) 
3d = 0 (5.346) 
Particular derivatives which are needed and may be readily verified are 
dE; dE; 
aA; 8:5» ad; 5:5 
OB; C) 
——i = ¢, —V-A= 347 
dAxt €5lk> aA. 1 V-A= bq (5.347) 


where €,, is the Kronecker permutation symbol whose value is zero 
unless j, /, and k are all different, | if they constitute an even permutation 
of 1, 2, 3, and —1 if they constitute an odd permutation, and “i” signifies 
differentiation with respect to x;. 


296 Propagation and Scattering of Waves 


Equations (5.345) now become 


8 pepyn 2 (1p) 1a 1 a8) _ 
Fy (€E;) ™ Ox: (J, cuwBs) i ax; (v- A+-> we at 0 


or 


E 
eS tv xB TZ V(V-A)—eL Vb =0 (5.348) 


Writing E and B in terms of the potentials, this reduces to 


—va+t {3 =( (5.349) 


Similarly, (5.346) becomes 


< Fi Al 1 0d\ __ 
ox, —+e-—(V:- A+ 352) =0 
or 
1 0° 
—2 —_—_—_ — 
V +7, a7 0 (5.350) 
If there are sources specified by charge and current densities p and 
J, respectively, the Lagrangian density may be modified by adding to 2 
the term 


L'=—po—J-A (5.351) 


This adds a «J on the right-hand side of (5.349), and p/e to the right-hand 
side of (5.350), as required. 
We may now define generalized momenta in the usual way: 


Le (5.352) 
ad, 
9% _ 1 ag 
m= (V- A+, “e) (5.353) 


We see, then, that the Lorentz condition may be formulated as 


T) =0 (5.354) 
From the generalized momenta, it is possible to calculate the Hamiltonian 
density: ; 
a = 1A; + ob) — L (5.355) 
Eliminating the time derivatives we obtain 
] 


ss Ne oat oh th 2 
H=7 0 aa PPG a +pot+J-A 


—7:Vb—wWaryV - A+ u?7,2/2€ (5.356) 


23. Normal Modes of the Electromagnetic Field 297 


From this the equations of motion may be calculated, and are found to be 


» OH ae. ue 
. OH 
Ts age ee (5.357b) 
- CO 1 
A= 77 7 Vo (5.357c) 
and 
OH | 
aaa Vas J-ZV xX (V X A) + u°V 275 (5.357d) 


Equations (5.357a) and (5.357c) are definitions of 7) and 7 respectively, 
while (5.357b) and (5.357d) are true dynamical equations. 

There is an additional restriction upon these equations, arising 
from the fact that the continuity equation 


0 
V-J+=0 (5.358) 
must be satisfied. From Eqs. (5.357b) and (5.357d) it follows that 
2 
me — 202, = 0 (5.359) 


We note that Eqs. (5.357) lead to Maxwell’s equations only if 77) = 0. 
Thus, from all the solutions arising from our Lagrangian, physical reality 
is represented only by those which satisfy this condition. Since this is 
only one of the possible solutions of (5.359), the restriction is a real one. 
A sufficient hypothesis would be that at some initial time 7 and d7,/dt 
were zero everywhere. It would then follow that they were zero at all 
times. 


23. Normal Modes of the 
Electromagnetic Field 


Consider the problem of expanding the electromagnetic field in 
normal modes, so as to introduce normal coordinates. If we consider first 
the vector field A, we note that any vector field can be broken into two 
parts: 

A=A,+A, (5.360) 
one of which is solenoidal: 
V:-A,=090 (5.361) 
and the other irrotational: 
VxA,=0 (5.362) 


298 Propagation and Scattering of Waves 


[This is true for the source-free field. If there is a part of the field satisfy- 
ing both (5.361) and (5.362) if can be written as Vf, where f satisfies 
V2f= 0. The only source-free solution of this equation is f= 0.] 

Since an irrotational field can be expressed in terms of a potential, 
it is possible to write 


A, = Vis (5.363) 
If we define orthogonality of vector fields A; and A; by the condition 
J AF-A;dr=0, iF] (5.364) 


the integral being taken over the region of definition of the problem, then 
A, and A, as defined above are mutually orthogonal. For 


f A¥- A. d= f Wut A, dr 
=—f w*V ° A, d?r 
=0 (5.365) 


where we have used Gauss’ theorem in the third line. We note that this 
orthogonality condition does not imply that the vectors A,, A, are orthog- 
onal at each point: only the integral of the scalar product is involved in 
the definition. 

Since normal modes are source-free solutions of the wave equations 
for which the time dependence is e~, they satisfy the equations 


—V’E+ epa’E = 0 (5.366) 
or 
—V7*h+ euw*d = 0 (5.367) 
For brevity we write 
Ew? = k? (5.368) 


Let the two independent sets of mutually orthogonal solenoidal vector 
solutions satisfying the boundary conditions of the problem be é,,,, where 
a= 1,2 and 7 runs from 1 to © [the third independent vector solution is 
of the form Vv,, where v, are solutions of (5.367)]. By their definition 


V Sno = 9 (5.369) 
The &’s may also be taken to be real. The eigenvalues are taken to be 


k2, for v, and k2, for €,,, and both v, and the &,,’s are assumed to be 
normalized. 


We now write 
A= DS Gna(t) Sna(t) + D b,(t)Vo,(r) (5.370) 


o= 2 Cy (t)U_ (VP) (5.371) 


23. Normal Modes of the Electromagnetic Field 299 


We then calculate in terms of these normal modes the Lagrangian 
L={£d* (5.372) 
where .¥ is given by (5.342). Now 
E=—> ¢pV0n— ¥ Gnagna— DL bnVUn 
By orthogonality, and using (5.365), 
te [ Ed’ = 2e| E ae+ E Kealen + by) | (5.373) 


since 
i} Vv, ° Vvm a*r = | VUnV2U» a°r 


= kom if V_Vm AE 
=0 ifnAm 
The integral of the magnetic term is 


— 37D Anata { (VX Ena) (VX Ene) dor (5.374) 
To evaluate this, consider first the vector identity 
V-(axb) =—-a-VXb+b-VXa 
Putting b = V X a’, this becomes 
V:- [ax (V Xa’)] =—a-VX(V Xa’)+(V Xa)-(V Xa’) 


Now put a= é,,,. a’ = &nq, and integrate. The divergence of the left- 
hand side will be zero, since it may be transformed into a surface integral 
at infinity. Also 
Vx (V Xa’) =VX (VX Ena) 
=V(V> Sn’) —V"*snat 
= ke Ena 
Therefore (5.374) is 


] ] 
—3, | Bar=— 5 SS aes (5.375) 


where we have made use of the normalization of the &’s. 
Finally, look at 


1 1 a¢ 1 dd 
an) (9 Ata) mae | [+a 
] tenet 
= 3a | I-> a a) ent | d*r 


eee 2 
=5-> (jae — Babs ) (5.376) 


300 Propagation and Scattering of Waves 


We thus obtain the Lagrangian in terms of the amplitudes of the normal 
modes and their derivatives: 


1 ; 1 
L= 5€ S Gna Fy > k2,.a2, + Lo (5.377) 
where 


, 1 | z 
Ly = 56D, Kalen ba) 35S (sete Mnba) 5.378) 


L, combines the scalar and longitudinal vector fields. The first two terms 
of (5.377) describe the ‘‘radiation’’ field. Generalized momenta can be 
defined; the problem of the dynamics of this field is that of a collection of 
oscillators, one for each normal mode. 
If we use (5.378) to calculate the equations of motion for b, and c, we 
find 
b, + u?k3,.b, = 0 (5.379) 
and 
Crt u?ke,Cn = 9 (5.380) 


Since the only physically acceptable solutions are those which satisfy the 
Lorentz condition, however, these equations are not to be solved inde- 
pendently, but are related by the equation 


c, +b, =0 (5.381) 


It is straightforward to generalize the foregoing formalism to take 
account of that part of the Lagrangian density describing the interaction 
of the field with particles [Eq. (5.351)] 


L'=—pod—J-A 
p and J must be expanded in the same way as ¢ and A: 


p=) pr(t)o, (5.382) 
and ‘i 


J= > JinV Vn + > Jiebaa (5.383) 


where the subscript ‘“/”’ designates the longitudinal current. The coeffi- 
cients may be calculated by making use of the orthonormality of the v’s 
and é’s: 


pa(t) = | pv, d*r (5.384) 


Jnn(t) =e | (V-J)o, dr 


1 dp 
2. | or Un at (5.385) 


25. Plane Waves 301 


and 
Ina(t) = f Ena dr (5.386) 
Note that 
P+ KB, Jin = 0 (5.387) 


The additional terms in the Lagrangian are then 
L’' = J L' d*r 


5.388 
=—[5 prcet E Kiytnbat 5 Jnana eee 


The additional terms in the Hamiltonian are given by the square bracket, 
without the minus sign. Equations (5.379) and (5.380) are now modified to 


. l 1, 
b,+ Uke Dn = edn as ekz, Pn (5.389) 


and 
é, + u7kz,.Cn = u7p,/€ (5.390) 


Problem 5-24: Verify that the Lorentz condition 
C, + b, = 01s consistent with Eqs. (5.389) and (5.390). 


24. Coulomb Gauge and Pure 
Radiation Field 


The irrotational part of the vector potential, written in the last section 
as VV, can obviously be eliminated by the gauge transformation (5.282); 
one need only choose x = — V. The resulting gauge, in which 


V-A=0, (5.391) 


is known as the “Coulomb gauge.” The resulting vector potential then 
describes the pure radiation field. Since, from (5.283), the scalar potential 
¢ must then not depend on the time, it satisfies the equation 


—~V°o = ple (5.392) 


and describes the “‘longitudinal’’ part of the field. This is, therefore, a very 
convenient gauge for the treatment of radiation problems. 


25. Plane Waves 


It is easily verified that 


go = f(n-r—ut) (5.393) 


302 Propagation and Scattering of Waves 


and 
A=Fi(n-r—ut) (5.394) 


are solutions of the source-free wave equation, n being an arbitrary unit 
vector and f and F arbitrary functions. These are plane wave solutions in 
the sense that the potentials are constant over the planes n-r—ut= 
constant; furthermore, the plane of constant phase moves perpendicular 
to n with a constant speed (“phase velocity’’) u. 

If these solutions are to satisfy the Lorentz condition (5.283), the 
relation between f and F must be 


f' =un-F' (5.395) 


where the primes designate derivatives of the functions with respect to 
their arguments. Since constants are of no importance in the potentials, 
we may take f= un- F. 

It is now easily verified that the fields are given by 


dA 
BS Vor s. 


=—un:(n- F’)+ uF’ (5.396) 
= — un X (nX F’) 


and 
B=VxXx<A 


=nxXF’ (5.397) 
The Poynting vector, describing the energy flux, is 


P,; = E x H 
=—T [ax (ax F’)] X (n X F’) 
==n(nx F’? (5.398) 
iv 
The energy density is given by 
Y= 5(D-E+B-H) 


1. eg rye 
= 5 eu (n X F’) Bera, 


= : (nx F'? (5.399) 


From these results we make the following observations: 


25. Plane Waves 303 


(a) The electric field E and the magnetic induction B are at right 
angles to each other and to the vector n. 

(b) The energy flux in the wave is in the direction of the vector n, that 
is, perpendicular to the planes of constant phase. 


(c) We may define a velocity of energy flow u by the equation 
u&% = P, (5.400) 
It follows from (5.389) and (5.390) that 


u= un (5.401) 


Thus, the velocity of propagation of energy is equal to the phase 
velocity. 

(d) The energy density in the electric field and that in the magnetic 
field are equal. 

Let us consider the question of the polarization of plane waves. It 
was shown above [Eq. (5.321)] that for waves polarized about the direc- 
tionn 

mxA=+A 


the + and — signs corresponding to right and left circular polarization, 
respectively. If F.. represent polarized waves, 


nx F, =+F. (5.402) 

Substituting in (5.396) and (5.397) we obtain 
+= uF. (5.403) 
B. = + iF: (5.404) 


We must not, however, apply (5.398) and (5.399) to these solutions, since 
these latter equations are valid only for F real, whereas the polarized solu- 
tions are, by (5.326), complex. 


Problem 5-25: Writing Fi. = (itij)\e®, show that 
the real fields are 
E.. = ud (cos 0i= sin 6 j) 


B.. = \ (tsin 8 i+ cos 6j) 


Hence calculate the energy flux and density. 


Problem 5-26: Earlier in the chapter we discussed 
“guided” sound waves. Similar problems may be studied for electro- 
magnetic waves. It is found that in guides of uniform cross section there 
exist transverse electric modes (no electric field along the guide) and trans- 
verse magnetic modes (no magnetic field along the guide). 


304 Propagation and Scattering of Waves 


(a) Calculate the electromagnetic fields in the transverse electric 
(TE) and transverse magnetic (TM) modes, in a rectangular guide of 
dimensions a Xb. (The walls of the guide are assumed perfectly con- 
ducting.) 

(b) What is the maximum wavelength which can be propagated in 
each case? 


Problem 5-27: Solve the corresponding problem for 
a guide of circular cross section. 


Problem 5-28: A problem on spherical waves. Find 
the electromagnetic waves in a spherical resonator excited by an oscill- 
ating dipole of moment P and circular frequency » located at its center. 


26. Plane Waves as Normal 
Modes 


If the above functions of the form f(m-r—ut) are expressed as a 
Fourier integral: _ 
f(n-r—ut) = J FR) eieneun dk (5.405) 
where 


Fk) = | f(s)ent# ds (5.406) 


we see that these solutions may be expressed in terms of the functions 
ek rot) where 

k=kn (5.407) 
and 

w= ku (5.408) 


In fact, the general solution may be so expressed since this is merely 
its expression as a three-dimensional Fourier transform. The normal 
modes are now characterized by the vector k. The theory of the previous 
section may be applied if we make the identification 


3 
v,(r) > (=) eitk-ror) (5.409) 


3 
Una(r) > (5) Cio € (kro) (5.410) 


The fact that the u,,.’s satisfy (5.369) implies that 
K - Cka = 0 (5.411) 


so that the vectors e,,, are transverse (orthogonal) to the propagation vec- 
tor k. The quantities k3,, k2, are now simply k?. The vectors e,,,must also 


27. Radiation from Given Sources: Multipole Fields 305 


be mutually orthogonal, that is, 
Cr" xe = 0 (5.412) 


27. Radiation from Given 
Sources: Multipole Fields 


Equations (5.284) and (5.285) relate the fields @ and A to their 
sources, the charge density p, and current density J. The solutions may 
be written down by analogy with that for sound waves [Eqs. (5.68) and 
(5.72)]: 


_ 1 p(r’,t—Ri/c) Sue! 
o=7 — RO der (5.413) 
and i Ric) 
ph r’,t—RIic 3 
= —_—_ , . 14 
e| R a" (3.414) 
where 


R=\|r-r’| 


It is possible, without difficulty, to generalize these equations to take 
account of a source of a different character, namely, a time-dependent 
external magnetization. The intrinsic magnetization is in general defined 
by 

M = B— »,H (5.415) 
so that 
B= »H+M 


This intrinsic magnetization is, for isotropic media, proportional to the 
magnetic field, the proportionality factor being ox, where x is known as 
the magnetic susceptibility. Thus 


B= (1+ x)H (5.416) 


The external magnetization M, must be added to the M above. Thus, the 
equation 
B=Vx<A 
must be replaced by 
B= VxA-—M, 


If this is now substituted into the equation 


VXB=pJ+e 


and B, E are expressed in terms of potentials by the above and the equa- 
tion (5.278), we are led to the equation 


dd , aA 


] _ 
V(v-A)—VPA+( ¢ aR) = w+ VX Mg (5.417) 


306 Propagation and Scattering of Waves 


Therefore, the external magnetization may be taken into account through- 
out by adding to the current a ““magnetization current” (1/44)V < Mb. 

We treat, in particular, the problem of radiation in empty space 
from sources with a given frequency: 


p(r,t) = po(r)e (5.418) 
J(r,t) = Jo(r)e (5.419) 
where, by virtue of the equation of continuity 
V -Jo—iwp, = 0 (5.420) 
The potentials then become 
pe ine | po(t’)2(r—r') dr emit (5.421) 
A=Z f Jo(r')g(r—r') der'ent (5.422) 
where 
et@loR 
g(R) = R (5.423) 


is the spherical ““Green’s function’”’ of the radiation field. We note that 


2 
Ve=—Se8 (5.424) 


We now show how to represent the fields in terms of a vector poten- 
tial alone. To begin, we note the identity 


(Jo XV’) X V'g=—JV'’2eg4+ (Jn- V')V'g 
so that 
Cc 
Jo = Gal (So XV’) X V'g— (Jo V')V'g) (5.425) 


Here V’ represents the gradient taken with respect to the coordinates r’. 
Substituting this in A above, we get 


re a | [(Jot¥’) XV'e—(Jo- V')V'g] d2r'enivt 
0 
But 


V'g=-—Ve (5.426) 
SO it is possible to write 


A= (A,+A,)e7@ (5.427) 
where 


A, = vx | (JoX V')g der’ (5.428) 


27. Radiation from Given Sources: Multipole Fields 307 


and 
l 
— 'e , S_e! 
Ay aad | i V'ed*r 


1 


Eqw” 


v | (V’ -Jo)g dr’ 


Sesto | ade (5.429) 


EgW 


Clearly, A, contributes nothing to the magnetic field. Also, Ag, 
along with ¢, contribute nothing to E. For 


1 1 
—Vé—As=-ZV | pos de’ +2 | pos d'r’=0 


Thus, A, alone completely describes the electromagnetic field of the 
sources. By virtue of (5.426), it may equally well be written as 


A, = fe Vx k x | Jok a" | (5.430) 
Eg®@ 
Having already shown that A, and A, are orthogonal, we now show 
that A, itself may be broken into two orthogonal parts. To do so, we first 
observe that if ©’(R, #) is a solution of the wave equation, so is 


A, = L®’ =iV Xr'®' (5.431) 
where L is the operator — ir X V defined in (5.310). In quantum mechanics, 


this is the angular momentum operator. Its most important property 
here is that it commutes with the Laplacian operator; that is, 


V7L®’ = LV?’ (5.432) 
From this result, which is simple to verify directly, it follows that 
2ch’ 2 
v2(Le’) =LLe? 127 ge) (5.433) 


u> ot? u* at? 
If the time dependence e—< is substituted, 
®' = Pi e-ivt (5.434) 
the equation becomes 
V7(L®’) + k*L®’ =0 (5.435) 
where k? = (w?/u?) = eww? was defined in (5.368). Being a curl, A, is 
also orthogonal to Ag. 


Finally, we show that 


An = iv xLO’ (5.436) 


308 Propagation and Scattering of Waves 


is also a solution of the wave equation and is orthogonal to both A, 
and A,. Its orthogonality to A, follows again from the fact that it is a curl. 
That it 1s a solution of the wave equation follows from the fact that V? 
commutes with both V and L; consequently 


V7(V X L)®, = (V X L)V?4, 


The proof that Aj is orthogonal to A; [in the sense of (5.364)] makes 
use of the fact that the operators iV and L are Hermitian, that is, satisfy 
the relations 


J feiVA dr = | AGVA)* dr (5.437) 
J ELA = [ f\A)* a (5.438) 


The first results from a simple integration by parts (we assume, as always, 
that f, and f, vanish sufficiently rapidly at infinity that the surface integral 
S fi. f#n dS is zero there). As for the second, it is sufficient to prove it for 
one component of L; let us choose L,. Now if we use spherical polar 
coordinates, we see that, since x=rsinécos¢, y=rsin#@ sing, and 
zZz=rcos@, 


and 


Of [2 Ox ar 32 
i = - if Ht 
dg Oxdp dyde 


-i( a 2) 
oy y Ox 

=L[, (5.439) 
Thus, 


| FL& ar=—i| fe arr 
=i|[ ,22a% 
7 | fi(Lef)* dr (5.440) 


The transition from the second to third line is accomplished by partial 
integration with respect to ¢, and making use of the fact that f,, f. are 
single-valued. 

Having thus shown that the operators iV and L are Hermitian, we 
prove that A;, A; are orthogonal as follows: 


[ avatar = | (Lo’)*- TV XL" d°r 


=z | ork Vx Le" d°r (5.441) 


27. Radiation from Given Sources: Multipole Fields 309 


But 
L-VxXL=(LXV)-L 
=[(rxV)xV]-L 
=(r-V)V-L—-r:-V2L 
= 0 
since both V -Landr-L=0. 

Thus, we have succeeded in proving that A; and Aj are two solutions 
of the wave equation which are orthogonal to each other and to A). Before 
proceeding with a multipole expansion, let us note the nature of the 
fields generated by Aj}, Aj, respectively. The fields generated by Aj are 

E’ = iwoL®’ (5.442) 
w= vxLe’ (5.443) 
Ho 


E’ has no radial component, since r - L = 0. The radial component of H’ is 
on the other hand 


Leeper vx Le’ (5.444) 
r Mol 


= —!_ yp’ 
Mol 


Since only the magnetic field has a radial component, we refer to the 
radiation described by this potential as ‘‘magnetic” radiation. The energy 
flux is 

P= ReExXReH (5.445) 


since the real parts represent the physical fields. The real part of the 
product of two complex numbers Z; and Z, is 


2( zz + 2%) = Re (zz) 


Thus the energy flux contains both radial and transverse parts. ‘The radial 
part, which describes the radiation of energy from the source, is 


1 p=— Reior : [(L®’) x (VXLO’)*] 
r Mor 


—~ © ReiL®’ -rx (VxXL®’)* 
Mol 
—— © ReiL®’ - (r- V)LO’* 
[Mol 


_ a. Re iL®’ ‘Loo (5.446) 


310 Propagation and Scattering of Waves 


Finally, we can calculate the angular momentum of the radiation; this is 


= aa rxP (5.447) 


= w ° , , 
hoc? Re ir X [L®’ x (VX L®’)*] 


=—". Re L®’ (—ir -V XL®’)* 
Moc 


= ad Re L®’ (L20’)* (5.448) 


[The momentum density was shown in (5.338) and (5.340) to be (1/c?)P.] 
Let us turn next to A’. The corresponding fields are 


E" = < Vx Lo” (5.449) 


and 


1 
H’ = —Vx (VXL)®" 
‘tok ( ) 


-—l yo (5.450) 
Mo 


It is now the magnetic field which has no radial component: The radiation 
is designated “‘electric”’ radiation. 

The energy flux in the radial direction, as well as the angular momen- 
tum, are given by exactly the same formula as in the case of magnetic 
radiation. 

We know that the potential (5.430) can be written as the sum of 
terms of the form (5.431) and (5.436). The problem remains, however, to 
find the functions ®’, 6". We do this in the following way: We expand the 
®’s in spherical harmonics, and use the orthogonality of the terms in the 
expansion to determine the coefficients. 

Let us first demonstrate this orthogonality. We write 


Q’ = > Zim (kr) Yim (8, ¢) (5.451) 


where Y,,, 1S a spherical harmonic. (It is convenient to write m as a sub- 
script rather than a superscript from this point forward.) We now show 
that 

(a) f (L¥ rm) * = L¥ im dO. = [1+ 1) 81S manv (5.452) 
and that 

(b) J (iV X Lm (kr) Vir) ® 80 X Lim Ym AD 


= ld+ 1) kz, (kr) 61 Omm’ (3.453) 


27. Radiation from Given Sources: Multipole Fields 311 


The z’s could be ignored in case (a) because Lz;,,(kr) = 0. The integrals 
are over solid angle. 


To prove (a), we first make use of the Hermitian property of the 
operator L to write the integral as 


J YiwL?¥im dO 
Now 


L?=—(rX<V):(rXV) 
=r-VX(rXxV) 
In index notation, we can write this 


pay dpe yo, a 
. Ox; : OX; : OX; % Ox; 


= [r°V?+r-V])—Br-V—(r- V)?+r- V] 


= ry2— po (+2) sap 
or 


or or 
= p2y2— p29 9,9 

or? or 
ee es ee a ee 
~ sin 0 90 (sin : 5) + Sin? 6 ag? 


from (3.28), 6 and¢ being polar coordinates. But from (3.45) and (3.62) it 
follows that 


L?Yim = (+1) Yim (5.454) 
Out integral is then equal to 


I(l+ 1) 885nm (5.455) 
as we wished to prove. 


Turning to (b), and making use first of the Hermitian character of iV 
and then that of L, we obtain 


f Levin (kr) Yom )* -VX (VX LzimYim) dO 
= —f{ (LZym Yrm) a V*LZimY im dQ) 
= KZ my Zim { Yim * L?Yim dQ 


ea l (/ + ] Zr my ZimOw Omm! 
as required. 
We are now in a position to complete the evaluation of the potential 
in terms of an expansion in harmonics; that is, a multipole expansion. We 
start from the fact that 


5 ee : 
=e¥™ [Vx f Jogar’ |= LS crm ¥rm(Q)+F VX LS Lem Yr) 


(5.456) 


A, = 


312 Propagation and Scattering of Waves 


where Zym', Zrm are to be determined. We first multiply on both sides of 
this equation by (LY,,,)* and integrate over solid angle. This gives 


| (LYin)* - VX k | Jog’ |r = (1+ 1)Zim(kr) (5.457) 


We now use the expansion for g given in (5.135), 
g(r—r’') = 4rik Tilkr’ ORY (QD nO’) (5.458) 


In using this expansion, we use primes to designate coordinates within 
the source, and unprimed symbols to designate the coordinates of the 
(outside) point at which the field is being calculated. Thus r’ < r. Equa- 
tion (5.457) may now be written 


| 


Ew 


Zim(kr) =a | Yi,L - VX |v | Joeer’ | dQ 
K+ } 7™ WSO | oe 
_ Ko | * 7! ’ 
=ty | vial [a L' gd’ | dO 


ip 
= rs D Y%, | | v -(r' Xx Jed | dQ (5.459) 


In going from the second to the third line, we have used the fact, which is 
easily verified, that 
Lg=L’g (5.460) 


Substituting for g from (5.458) we find that 


Zim(kr) = wey h{(kr) | V' or’ XJuidkr’ YEO’) Br’ (5.461) 


We may proceed similarly to find Z,,,(kr). We multiply (5.456) by 
(1/k)V X LY#, and integrate over solid angle. Proceeding on the right- 
hand side as in the derivation of (5.453), we obtain the result that 


: Siete Seen a (a 
Zim(kr) = eo? KI+ 1) | rau X LYi,)* «VX (v | Jog@r } dQ, 


ee ee oe 
= Sat | LY vx( | Jeger’) do 


=o | Yin | | Jo: V’ Liga | dO, (5.462) 


The inner integral may be transformed into a more convenient form: 


27. Radiation from Given Sources: Multipole Fields 313 


J Jo V' XL gd'r! =i f Jy: [er +t 8" + VY] gr’ 


= ik? [r+ Sogdr' +o f ptr’: V’) gd?’ 
(5.463) 


To obtain the last term we have integrated by parts and used the conti- 
nuity equation. Thus, 


5 — Mo * | 7 ane Sr’ 
Zim(kr) K+ 5 | Yi, GE Jog d*r 
+2 | p(l +r’ +’) gar’ | dO (5.464) 
Substituting again for g this takes the form 


Zink) = petits hi Ukr | {pol +e" V) 


+E «Joh kr inl) oe (5.465) 


where c, the velocity of light, is equal to 1/V é€o49 and k = w/c. 
Let us now introduce 


fies ~ae Wr! X Joker’) Yin(O") ar’ (5.466) 


and 


ou 7 , ik t ° v , ? 
din = TET) | {po(1 +r -V ar Soh jkr \Yin(Q’) dr’ (5.467) 


Problem 5-29: Show that, if there is also a source 
consisting of a magnetic moment, it contributes to d;,, an additional term 


“eal {V! > Mo(1 +4! = V!)— Her’ + Mo} kr’ )Yim(Q') dr’ 


and to d;, a term 


~ Wis N) ay | V! = (re! X Mo) jkr’ )¥ in") dr" 


If we now use (5.431), (5.436), and (5.451), we can write the magnetic 
and electric radiation, respectively, in the following form: 


Magnetic 
A, = Lo’ 


314 Propagation and Scattering of Waves 


where 
dD’ = ji > Dinh? (kn) Yimn(Q) (5.468) 
l,m 
Electric 
| l ul 
A, = k V xX L® 
O"=1> diphM(knYim(Q) (5.469) 
l,m 


The fields are of course given by (5.442) and (5.443) for the magnetic case 
and (5.449) and (5.450) for the electric. The terms of a given / in each case 
describe 2'-pole radiation. 

To calculate the intensity of radiation, we use the forms of these 
functions in the asymptotic region kr > 1. There 


: eikr ; ; 
QD’ = oe Ss (— i)! dimY im (5.470) 
and 
peer “i 
pd" = a Ss (— i)! dim Y im (5.471) 


Substituting in (5.446) and integrating over a sphere of large radius 
surrounding the source, we get the integrated radial energy flux 


Ze 2— © 2 
Pf, fhe (1+ 1) |dun| ae L(1+1)|din| (5.472) 


l,m l,m 


where the appropriate d,,, is used for each sort of radiation. 
If the energy of the radiation per unit range of ris Y, then 


U =P, => M+ Ven? (5.473) 


From (5.447), it is also straightforward to calculate the angular 
momentum of the radiation of each multipole order. The z component of 
angular momentum in a shell of unit thickness dr at distance r in the 
asymptotic region is 


1 
m, = — ml(1[+1)|din|? 5.474 
me (I+ 1) |din| ( ) 
Therefore, 
U 
m,=m— (5.475) 
@ 


In problems of radiation from atoms and nuclei, the size of the radiat- 
ing system is generally small compared to the wavelength of the emitted 
radiation; that is, ka < 1, where a is the radius of the emitter. By virtue of 


27. Radiation from Given Sources: Multipole Fields 315 


this fact, in calculating the radiation of a given multipole order from 
(5.466) and (5.467) the spherical Bessel functions may be replaced to a 
good approximation by their value for small r’: 


ent 2 
In the electric multipole expansion, we note also that 


Bieye “Jo _ ~ (ka)? 


so that the second term in «5.467 can be neglected. Then 


ns 2'(1+1)! yo ' ' 
d' Ne ESI LACIESIY Po(r' )r"Yim(O") ar (5.477) 


Recalling the definition of the electric multipole moment in Eq. (3.142) of 


Chapter 3, 
m= VIF i| p(t’ )r"Yim(Q') dr’ 


qv =—_1 _, ote ta (5.478) 
™ MVan(21l+1) cf (20! 


The intensity of the corresponding electric 2'-pole radiation is then 
(eD — Cc 2l+2 I(l+1) 2)'(1—1)!]? 2 
Pim Eo £(2) 9i+1 (21)! t |Qimn| (5.479) 


Similarly, in the case of magnetic 2’-pole radiation, we can define the 
magnetic multipole moment 


Om = req | Viet XJor'"Yin(O) de’ (5.480) 


in terms of which we may write 


we may write 


. ] w\!*} 1 wn 
 VanQl+h “ (2) Ti+) (pig —« 5.481) 


Therefore the intensity of magnetic 2'-pole radiation is 


Pimav) — @\?!+2 l pone 2 = 
cp (®) (+1 (2i+1)| (2)! | [Qim|? (5.482) 


r 
dim = — 


Problem 5-30: Calculate the contribution to the 
multipole moments of various orders from an oscillating magnetic 
moment M = Moe ~%. 


316 Propagation and Scattering of Waves 


28. Radiation from a Uniformly 
Charged Vibrating Liquid Drop 


The problem of radiation from a vibrating liquid drop carrying a 
charge is of interest because the liquid drop provides under certain 
conditions a model of an atomic nucleus. The model is a suitable one when 
the currents which are the source of the radiation are primarily collective, 
that is, are carried by the whole charge of the nucleus, rather than being 
associated with specific particles. The model we treat is a completely 
classical one. The charge is assumed to be uniformly distributed. The 
undisturbed shape of the drop is taken to be spherical. The dynamics of 
the drop are that of an incompressible fluid acted upon by its internal 
Coulomb forces and by surface tension. 

The Lagrangian therefore contains three terms: the kinetic energy, 
the Coulomb energy, and the surface energy. If p is the charge density and 
J the current, there is a continuity equation 


v- 5+ 22 = 0 (5.483) 


Assuming the motion to be irrotational, J may be expressed in terms of a 
velocity potential 


J=—V¢ (5.484) 
Since the charge density is constant inside the drop, 
V76=0 (5.485) 


If the surface of the drop is oscillating, the boundary conditions are 
time-dependent, so we can write 


d = >» Aim(t)r'Yim(9, ¢) ’ al,—-m = dim (5.486) 


We now assume the equilibrium radius to be Ro. The shape of the 
surface may in general be expressed as 


r=ro(t) = Rot > Cim(t)Yim(9, 9) (5.487) 


We assume that the distortion is small, so that the c,,,’s are small com- 
pared with Ro. 

The boundary conditions now enable us to express the a, »’s in terms 
of the c,,’s. The kinetic energy can be calculated in terms of @ and 
therefore of the a’s; the Coulomb and surface energies depend on the 
droplet shape and therefore on the c’s. 

The boundary condition is, to the first order in small quantities, 


Z (S) my (5.488) 


28. Radiation from a Vibrating Liquid Drop 317 


On equating coefficients of each harmonic, this yields 
Cim = — [Ro! dim (5.489) 


It should be noted that, for the volume to remain constant, Ry must 
itself be a function of ¢. If R is the radius of the drop at rest, so that the 
mass 1s 


M =4aR°y (5.490) 


where 7 is the mass density, then 


$aR? = { dO fore Pdr 


= f dO3[Ro+>d cimYim| 


= $7Ro°+Ro > |ciml? (5.491) 
if we omit as negligible terms cubic in the c’s. Therefore, 
1 
R,.(t)=R E — FarR? > cinl?| (5.492) 


to the same approximation. 
Let us now calculate the terms in the energy one at a time. The 
kinetic energy is 


T=i1y J (Vd)? d3r (5.493) 


where 7 is the mass density. Since V7¢ = 0 inside the drop, T may be 
transformed to a surface integral 


T=in J Vd -ndS (5.494) 


where the integral is over the boundary surface of the drop, and n is the 
unit normal vector drawn outward. To a good approximation n- V¢ = 
d¢/dr, and 


l 
T=3nD IR YD lal? (5.495) 
l m=—l 


If we write 
Cim ~ &im + Bin (5.496) 


where Qin, Bim are real and use (5.489), the kinetic energy becomes 


00 l 
T= om SIR SY [(Re din)? + (1m ain?) 
l=1 


m=-l 


I 1, 
= 5” Ro S T (Qin + Bén) 
l,m 


3M ] . 
l,m 


318 Propagation and Scattering of Waves 


where M is the total mass of the drop (replacing Ry by R here involves 
only omitting fourth powers of the distortion parameters). 

Next, consider the surface energy. The elements of solid angle and 
of surface area are connected by the relation 


dQ, = dScos 6 
r- 
or 
dS = —s (5.498) 
Cos 


where cos @ is the cosine of the angle between the radial direction and 
the normal to the surface. To calculate cos 6, we note that the unit normal 
vector to the surface 


oe Rot Dd CimY im = f(9, &) 
has components 
] & lof 1 c| 36 
: ndavo| (5.4 
1+5 (5) + dl (4) roo ~ rsin@ dp ( ) 
06 sin? 0 


op 


Thus, to the usual approximation, 
2 2 
5° VG) a 
cos 6 r> L\o0 sin? 6 \de 
~,4l1[(fV,_1 (af 
14 9R (35) + aaa (S) es 
The surface area is therefore 
lf /of\? 1 /fof\? 
—_ 2,2) ( a 
a J on ( +3[(35) ries I} 
1 /oaf\? 
—_ 2 paca 
4nRE +3 |cml?+5 5 | ao (35) +a5(2) | 


1 oO 
= 4nR*=Y (cml? 3} lat S(s in 8 of) + Gare ea| am (5.501) 


In obtaining the first two terms, use has been made of (5.492). The last 
follows on integrating by parts the angular integral. But the operator in 
the square bracket, operating on Y;,, was shown in Chapter 3 to give 
—Kl+1)Yin. Thus, substituting for f and integrating, we find that the 
surface area IS 


A = 4R?+D |crml? IU+ 1)— 1] (5.502) 


28. Radiation from a Vibrating Liquid Drop 319 


It should be noted that the /= 1 contribution is zero. This is due to the 
fact that the dipole ‘‘distortion” is simply a displacement of the center of 
the drop. 

Taking the surface tension to be o, the surface energy associated 
with the distortion 1s 


Vsurtace = o > (Qin + Bin) [3 i+ 1)— 1] (5.503) 


Finally, we must calculate the dependence of the Coulomb energy on 
the distortion. This calculation is simplified by noting that we may con- 
sider the charge distribution to consist of a uniform spherical distribution 
of radius Ro, plus a surface distribution of surface density po(7o — Ro) = 
PozCimY im» Po being the volume density. We then calculate separately 
the self-energies of the volume and surface distributions, and their inter- 
action energy. It is, however, easily seen that this latter is zero, since the 
potential of the volume distribution is spherically symmetrical, whereas 
the surface charge distribution has no /= 0 component. Their integral is 
therefore zero. 

The volume contribution 1s 


R Tr 
Po" | Lag | 4trr’? dr’ dr = 16 1? Ry° Po" 
or , 15 
Now po is given by 
Ze? 
$arR? 
where Ze is the total charge, so the Coulomb energy of the volume distri- 
bution is 


= Po 


3Z7e*R,” 3Z7%e? 32Z7%e? 
5 RFS RO Ame D Neomll (5.504) 


on using (5.492). The first term is just the Coulomb energy of the undis- 
torted distribution and is therefore neglected. 
As for the surface contribution, it is 


3 | Pe AD cim¥im(O) 5, crm Vem (YR a0 a0 


But 
1 
lr—r 1 


> P, (cos 0) 
where 6 is the angle between (2 and 2’ and by (3.136) 


4 
Pi(cos 8) = 57-7 > Yim(Q)¥ in(0’) 


320 Propagation and Scattering of Waves 


Consequently the surface energy Is 
| 
4npeRe > a1 \Cim|? 
l,m 


which is approximately, to the lowest order in the c’s, 


9Z7e? 1 
ar®RS B+ leon 


Hence the complete Coulomb energy ts 


3722 3 
V coulomb = — Fs by (Oh + Bia) (1 _ aa) (5.505) 


Once again, we note that there is no / = 1 contribution. 
Collecting the various contributions to the energy, (5.497) and 
(5.503) and (5.505), we get a Lagrangian 


3M 5! 3 (+1 
L=37 37 (Qin + Bim) +o >) (Qin + Bin) ( 5 | 
3722 3 
—7R > (im + Bim) (1 -shT) (5.506) 


Provided that the potential energy terms are positive, we see that the 
motion is that of periodic surface waves corresponding to / = 2 and higher, 
with circular frequencies w, given by 


_ Bal f_fi+1) _ | 2S ( 3 ) 507 
of Ste a1) aes (ar G20) 


If, on the other hand, 


2 2 
zoe > 4m Ro 


there is instability. In the case in which the liquid drop is an atomic 
nucleus, this instability is related to the phenomenon of nuclear fission; 
however, the fission problem is in fact greatly complicated by the quantum 
mechanics of the nuclear particles. Furthermore, of course, one cannot 
discuss fission in terms of small distortions. 

What we have dealt with up to this point is the motion of the charges 
in the drop. These determine the charge and current system which gives 
rise to the radiation. We now calculate this radiation. For this purpose 
we note that the current distribution is given by 


J(r) =—poVoS(r) (5.508) 


25. Radiation froma Vibrating Liquid Drop 321 


where S is a function which is | inside the drop and zero outside. Now 
b= D m(t)r'Yim(9,¢) 


1 
=-y> [Ron Cimt "Yim (9, ?) 
. ] 
=i> [Rois 2m OY m8, 9) et (5.509) 


We have taken advantage of the fact, which is evident from (5.506), that 
the oscillation frequencies do not depend on the quantity m. 

The radiation is now determined by the multipole moments. The 
electric multipole moment is 


Om = Va 2 ‘Po al V’ - (SV'dbo)r'Yim(O') dr’ (5.510) 


where we have substituted for p in terms of J from the continuity equation 
and qo is @ without the time-dependent part. Since V’*¢,)= 0, the surviving 
term is the one involving V’S. Keeping only the lowest-order term in the 
c’s, V'S =—8(r—R,) (r/r). Thus 


= TET | (SS) Rel Vam(") Ra? dO 
= po spr Re!**cim (0) (5.511) 


and the intensity of the electric 2'-pole radiation is 


1 (/')22%7 1+1 
Ame, (2/41) 1 


Turning to the magnetic multipole moment, and using (5.487), we 
see that we must first calculate 
V'-r' X(V'd-S)=—r' XV' -(SV'd) 


We note that there is in this no term of first order in the c’s; thus, unless 
there is a distribution of magnetic moments, there is no magnetic multi- 
pole radiation from a charged liquid drop. 

This problem was first solved by W. Fierz.* 


P2 1m = Z— -3Ze Se — Re"*|cim(0)|? (5.512) 


Problem 5-31: Using the results of Problems 5-18 
and 5-19, calculate the contribution to the radiation from the liquid drop if 
it contains a uniformly distributed magnetic moment, the total moment 
being M. 


3W. Fierz, Helv. Phys. Acta 16, 365 (1943). 


322 Propagation and Scattering of Waves 


Problem 5-32: We suggest here a problem involving 
some similar considerations, but without electromagnetic fields; it is 
merely a problem in gravitation and hydrodynamics. 

Suppose one had a planet of the same size as the earth and made 
entirely of water, and it vibrated under the influence of its internal 
gravitational forces. Show that the frequencies of its normal modes of 
vibration would be 


oz = StPGUI=1) 
3 (21+ 1) 


Describe the oscillation of lowest frequency, and show that its period is a 
bit more than an hour and a half. 


29. Radiation from an 
Accelerating Point Charge 


If acharge e moves along acurve 
r=fr)(t) (5.513) 
the charge density may be expressed as 


p = e6(r—r(t)) (S.514) 
and the current as 
J = ef5(r—ro(t)) = evo(t)d(r—ro(t)) (5.515) 


Let us consider this motion in a dielectric medium, in which the velocity 
of light is given by uv? = 1/eu. The particle emits radiation for which the 
scalar and vector potentials are given by (5.413) and (5.414): 


_ e. f 0 __ — x! ; 
o= 76, [ Hee ne r epi] : a re) op (5.516) 
and 


a=Se ly (1 F= ror ronal wart (Foe Dd gaye (5.517) 
jr—r'| 
This may be put in an instructive form by introducing as alternative 


coordinates 
r’=r'—r, (:—2*h) (5.518) 


It is straightforward to show that the Jacobian J defined by 


d*r" = J dr’ (5.519) 
is given by 


i R 
J=1-—ER-vy (:—8) (5.520) 


29. Radiation from an Accelerating Point Charge 323 


where 
R=r-r’ ($.521) 
It follows that 
ee: See 
? = Gre R— UJ) R- v(t (Riu) Dee) 
_ eb Vo(t— (R/u)) 
— 4a R—(1u)R-vo(t— (RJu)) 287) 


These are known as the Lienard—Wiechert potentials. The sense in which 
they are defined as functions of r and ¢ is the following: r’ is a function of r 
and ¢ defined by the equation 


Yare( SE) 5.524 


Its position may be seen graphically as follows: Let the curve C in Fig. 
5.3 represent the trajectory of the particle, which is traversed in the 
direction indicated by the arrow. P is the point r. The point r’ is at Q on 
the trajectory C. QO is the point on the trajectory corresponding to the 
position of the particle at a time earlier than ¢ by the time required for 
radiation, with velocity u, to travel from Q to P. 


vf 


P 


Figure 5.3 


324 Propagation and Scattering of Waves 


The formulas (5.522) and (5.523) for the potentials are very general, 
but their actual calculation in a given case may be quite involved. 


30. Motion in a Straight Line: 
Cerenkov Radiation 


If the particle moves in a straight line with constant velocity, vo 
becomes a constant. The equation for r’ becomes 


r'=y- (—H*h) (5.525) 
Let us take the Z axis in the direction of motion, and put 
B= “ (5.526) 
Since 
R 
R=r-—voy (2) (5.527) 


we can solve for R and 1/uR-v,. and hence obtain the potential. It is 
necessary to distinguish two cases (i) B < 1 and (ii) B > 1. 

In the first case, the velocity of the particle is less than the velocity 
of propagation in the medium. In this case, the field can be propagated 
ahead of the particle. But in the case in which vy > u, we meet the sort of 
‘“‘bow wave’’ effect discussed earlier for sound waves. 


p< 1 


Putting s? = r? — z?, we find that 


_ B(z— vot) + Vs7— BP) + (= Dot? 


R iB (5.528) 
and the potentials are 
ee, | cote (5.529) 
Are W/s?(1— B®) + (z— vot)? 
and 
ed | er (5.530) 


Am V/s?(1— B*) + (z— vot)? 

The fields may then be calculated to be as follows: the component of 

electric field radially outward from the line of motion is 
_ e s(1—?) 


Ss” Anre D3 (5.531) 


30. Motion ina Straight Line: Cerenkov Radiation 325 


while that in the direction of the line of motion is 


e (z—vot)(1 — B?) 
= 532 
E. 4ire D8 One) 
The magnetic field is totally transverse (around the direction of motion) 
and is 


__ evo S(1— fp’) 
‘=F —p: (5.533) 
In these formulas 
D= Vs?(1—B?) + (z— ot)? (5.534) 


There is no radiation from the charge because all components of the 
Poynting vector drop off as the inverse fourth power of distance at large 
distance; the integral over a distant surface then approaches zero. 
aqpep>1 
We now find that 
R=— V (z— vot)? — s?( PB? — 1) —B(z— volt) 
1 


There is, of course, only a nonzero solution for the potentials at points r’ 
satisfying (5.525). This in turn is the case only if the expression for R is 
positive and real. This is so if vpt —z > 0 and 


(5.535) 


vot—z = sVB?—1 (5.536) 


Thus, there is radiation only in a cone extending backwards from vot 
with semivertical angle 
1 


cn 


a = tan"! = sin ($.537) 


E,H #0 
a) i) 


Figure 5.4 


326 Propagation and Scattering of Waves 


The potentials become 


e 1 
= 5.538 
p Aire (vot — zy —_— s?(B? _ 1) ( ) 
and 
1 
<P (5.539) 


A=— 09 —————__——_ 
4 V (vot — 2)? — (8? — 1) 


in the cone in question, and zero outside. The potentials, and the fields, 
become singular on the bounding cone. 

The fields are most conveniently expressed in terms of spherical 
polar coordinates, with origin at the particle; the only nonzero compon- 
ents are the radial component of E 


e B?-1 


Ee ~ Grer® (1— BP sin? 6) 0.940) 
and the transverse component of H 
sin 6 (B?— 1 
Hy =——resin OB 1) (5.541) 


~ Amr2(1 — B? sin? 6)3?2 


These are the potentials of the Cerenkov radiation. The situation 
differs from that in the case 6 < 1, in that the fields become large, even 
at large distances, near the surface of the cone 


Bsin@d=1 


The energy flux is clearly in the 6 direction in the wake of the “‘bow wave.” 


31. Plasmas in a Magnetic Field 


Let us now consider a problem which arises in connection with the 
propagation of an electromagnetic field through a solid (for instance, a 
metal). We take a simplified model of the metal, according to which it is 
made up of positively charged ions which may oscillate about equilibrium, 
and “‘free’’ electrons. If now an electromagnetic field is propagated in this 
system, the oscillations of the field set up oscillating charge and current 
distributions. Since these are determined by the fields, we do not consider 
them as external. Rather, if one substitutes for p and J in terms of the 
fields in (3.14) and (3.17) we have a system of equations for those fields; 
their solution, if it exists, describes a wave maintained in the system 
without sources. If the solution is periodic, that is, if the fields have a 
time variation e~‘, it represents a normal mode of the electromagnetic 
field in the system. 

Alternatively, we might talk of a normal mode of the system of 


31. Plasmas ina Magnetic Field 327 


charges. If the periodic motion of the charges 1s first assumed, these 
produce fields. Those fields in turn determine the motion of the charges; 
classically, this motion is given by the Lorentz force equation (5.326). If 
this reproduces the originally assumed charge distributions, we will have 
found normal modes for the particles interacting with each other through 
their electromagnetic fields. If we call this system of charged particles a 
plasma, we shall have found the normal modes of the plasma. 

A particularly interesting case is that in which the metal is in a 
constant external magnetic field. Since this field is constant, it affects the 
oscillating system only through its effect on the electron motions; that is, 
it affects the distributions of charge and current. 

As far as the ions are concerned, let the dipole moment per unit 
volume associated with their displacement be P. The potential produced 
by this polarization 1s then 


et [rev d;' = I [rey-v I dsr’ 


Amré, lr—r'| Airey \r—r'| 
Seva [ver d*r’ 
4ir€y lr—r’| 


on integration by parts. But this is the potential of a charge distribution 
—V-P. We may then write (3.14) as 


V-(eE) =p—-V-P (5.542) 


where p is now simply the electron density. But if the polarization is 
proportional to the field producing it, we may put P = xE. Putting 65+ x = 
€)€,, to define the specific dielectric constant e, of the ions (the “‘lattice’’) 
we have 


Vv: €9€, E =p 


We have shown, then, that the ions of the lattice modify the dielectric 
constant of the medium through which the electrons interact. 

Strictly speaking, the charge distribution p should include the static 
potential of the ions in their undisturbed positions. This, however, clearly 
makes no contribution to the oscillating fields which we are considering. 

It is convenient to look for normal modes of a given frequency and 
wave number; that is, to assume that all quantities depend on space and 
time coordinates as 

ei(a-r—ot) 


The coefficients will be functions of q and w; for example, 


E = E,(@) ear) (5.543) 


and similarly for all other quantities appearing in the equations. Maxwell’s 


328 Propagation and Scattering of Waves 


equations then take the form 


i€€.q° Ea = py (5.544) 
iuoq-H,=9 (5.545) 
iq X Eq = iwpoHa (5.546) 

iq X Ha = Jq — iwene,,Eq (5.547) 


We now assume that the current Jg depends linearly on the field Eq. 
However, the relation between them may be, as we see below, a tensor 
One in the presence of an external constant magnetic field. Therefore, we 
write 


Jq = TEy (5.548) 
in (5.547). This equation then becomes 
iq xX Hg, = [— IWEo€,, oF a | | OP (5.549) 


where @ is a dyadic. 


It is now possible to define the specific dielectric constant of the 
medium as 


e= ate (5.550) 


in terms of which the last of Maxwell’s equations (5.547) takes the form 
iq X Hq = — iwee Eq (5.551) 


Similarly, since the equation of continuity 


7492 — 
V-JjJ+ ry 0 

takes the form 

q ° Jq = Wp, (5.552) 
the first Maxwell equation (5.544) becomes 

iq Dy = 0 (5.553) 
where 

D, = €€ Kg (5.554) 


Elimination of H, from the last two of Maxwell’s equations now 
gives as the condition for a self-sustaining field in the medium 


2 
qx (QXxE,)+3eE,=0 (5.555) 


The determinant of the coefficient gives the general form of the dispersion 
relation w = w(q). 


32. The Dielectric Tensor 329 


32. The Dielectric Tensor 


Let us suppose that the external magnetic field is in the direction 
of the unit vector n, and that the dielectric tensor can be expressed in 
terms of this vector as 


€i5 = €,5;; + (€5 — €) nin; + Ls Nick (5.556) 


Nix 1S used to designate the permutation symbol, to distinguish it from the 
dielectric constants e. Then 


D = cE = ¢€, E+ (e,— €, n(n - E)+ie.E Xn (5.557) 


Consider now components which are, respectively, right and left trans- 
versely polarized and linearly polarized with respect to n, designating 
them by the subscripts +, —, and 3. 
Then 
D.. = (€, +6 Es = €E. (5.558) 
and 
D, = 6&3E3 (5.559) 


To see how a tensor such as (5.556) may arise, let us calculate the 
dielectric tensor (or rather, the conductivity tensor, from which it may 
be derived) for free electrons by means of classical mechanics. Suppose 
that the position vectors of the electrons are r,(t). Then the electron 
density Is 


p=> 6r—-r,(d) (5.560) 
and the current is 
jJ=-—e >> V(t) d(r — r;(t)) (5.561) 
where ‘ 
_ a; 
vi= dt 


Let us expand this current in a Fourier series in a unit volume, the expan- 
sion functions being ¢4" , which satisfy 


[ c#r ca" dr = Bay (5.562) 
Then we may write, on multiplying through by c—'* and integrating 
Sq = J J(r) et dr 
=—e x V(t) ean (5.563) 


On differentiating, we obtain 


da’ oo, . 
aH = —¢ > [V;—i(q’ - V,)V;] e7a': (5.564) 


330 Propagation and Scattering of Waves 


Making use of the Lorentz equation of motion, 


dV; eé 


Therefore, 


dJy _ ig oe V. XB, cia tj —iq’: 
Tm me bt +> ; 0 @ +ie ¥ (q: V,)V; ea’: 


Consider the terms on the right one at a time. We calculate the current for 
a field of the form 
E = E) efor (5.566) 
Therefore, 
S; E(r;) eat E ewe > e (q-4')'r, 


It will be shown in Chapter 7 that the value of the sum is much smaller 
for q’ ¥ q than for q’ = q, the ratio approaching zero as the number of 
particles approaches infinity. We therefore write the sum approximately 
as 64,': this is the classical form of what is known as the ‘“‘random phase 
approximation.” In this approximation, the only component of current 
generated by the field (5.566) is Jq. 

Writing the second term on the right as —(eB)/m)J, <n for the case 
q' = q, n being the unit vector in the direction of By, and neglecting the 
last term, we find that 

2 
at = xe Ey e~ — Jaq X 0 (5.567) 

where we have put w, = eB,/m for the Larmor frequency (“‘cylotron 
frequency’’) of the electrons. If we now put 


Jg=Je™ 


(5.567) becomes 
. Ne? 
—IiwJ == Ko eed Xn (5.568) 


Taking scalar and vector products with n, and eliminating J -n and J Xn, 
from the three equations, we may verify that 


; 2 
IE, —e Eo x n——S (Eo ° nn | (5.569) 


From this we may calculate the conductivity tensor: 


iNe*w 


o4(q, @) = mw? — w,2) 


2 
(5, —i =e Nijkk — —¢ nuns) (5.570) 


From (5.550) it follows that the dielectric tensor is 


32. The Dielectric Tensor 331 


Ne? 1 Ne? @ | 
é,(q, @) = E eed 8:3 Me, * tae Nigel 
Ne? @,” | 
Mey @? w?—w2 25 (5.571) 


Therefore, in the notation of (5.556), 
Ne 1 


€, = €, — ie, Goon (5.572) 
— Ne*we | 
€ = rae ea (5.573) 
and 
Ne? 1 
€3 = eons ae (5.574) 


Before proceeding with the calculation of the normal modes, let us 
examine the approximation involved in the neglect of the term ie = (q- 
V;) V,e—4'". In magnitude, the contribution from the ith electron bears to 
that from the preceding term, — iw, V; X ne-4'" the ratio q- v;/w,. Writing 


it bears to the ith contribution from this term the ratio q- V;/w. The 
approximation is then valid provided q is small enough that q: V; < w or 
w,. The results which we obtain are then valid only for such values of q. 
When there is no magnetic field, it follows from (5.571) that the di- 
electric tensor is diagonal: 
Ne? 1 
ey = (a RES) by (5.575) 
From (5.555), since in the case of longitudinal modes the first term is 
zero, the condition that E, ¥ 0 is that e, = 0. The frequency is therefore 
given by 
Ne?» 48 
a Wp (5.576) 


2 nm 


This is the well-known plasma frequency. Introducing it into (5.572)- 
(5.574) they became 


@ 2 
€é,—€& (1 -{2 5) (5.577) 
_ Wp We 


and 
ae <.( —2s) (5.579) 


332 Propagation and Scattering of Waves 


Also, 


2 
Biwe re we 1 at (5.580) 


Note that, by virtue of (5.544), longitudinal waves are associated 
with a density oscillation, while transverse waves are not. 

Let us now look at the normal modes in the case in which the mag- 
netic field By is present. We consider two cases: (i) that in which q Is 
parallel to Bo, (ii) that in which q is perpendicular to By. 


(I) PROPAGATION IN THE DIRECTION OF THE EXTERNAL 
FIELD By. 

We start from (5.555). Since q = qn, the +, —, and 3-components of 
that equation are 


@” 
PE. = esEs (5.581) 
and 
€,E 5 =0 (5.582) 


From the latter, we see that the longitudinal mode is exactly as in the 
case in which the field was absent, that is, 


@ = @p. 
In the transverse case, the dispersion relation is 
c*q? = ws (5.583) 


for right and left circularly polarized waves, respectively. These are, 
then, normal modes for which the right-hand side of (5.578) is positive. 


Problem 5-33: Draw a sketch to illustrate the spec- 
trum of normal modes for the case in which @, > @,; show that in the 
case w, > w, the normal frequencies are approximately as follows: 

(i) for right polarization, below w, and above @,(1 + @,7/@,”) 
(ii) for left polarization, a band beyond w ~ @,?/a<¢. 


The situation is illustrated in Fig. 5.5. For the case of night circular 
polarization, there are normal modes with arbitrarily small frequency. 
Although the diagram indicates that the frequency spectrum of these 
modes extends up to w,, we must remember that our approximations have 
limited us to sufficiently small g; as @ > w,, however, g — ©. Therefore, 
these modes cannot be assumed to exist up to the frequency ,. 


32. The Dielectric Tensor 333 


we, 


we, 


Figure 5.5 


For small q, (and consequently small w) 
2. x Op _ 26 
We, ~ E, = Cd 
c 
so that the frequency is given by 


2772 
C°q°@ 
= c 


re (5.584) 
Substituting the forms of w, and w,’, it is found that 
ee ee €oBo 
o= cg (5.585) 


These transverse, right circularly polarized modes with quadratic dis- 
persion relation are known as helicons. 

For the case of left circular polarization (I.c.p.), there are no low- 
frequency modes. In the case of both right and left circular polarization, 
however, there are higher-frequency modes; for the l.c.p. case their 
frequency spectrum starts just below w,, and for the r.c.p. case, just 
above. 

Let us note the relation between the magnetic and electric fields in 
these modes. For transverse circularly polarized waves (5.546) gives 


+ qEqz == 1a [ty Hg~ 


334 Propagation and Scattering of Waves 


or 


Hyg+ =+ 


a iy (5.586) 
[Mo 


But for an electromagnetic wave in empty space 


ae 
Ags =F Chto | rs (5.587) 


Therefore, the ratio of magnetic to electric intensities in the medium is 
cq/w times its value in free space. For the helicons this ratio is €,(w,?/ 
cqw,). If a, > w,, as is the case in metals, this ratio may be very large~— 
that is to say, the oscillating transverse magnetic field associated with a 
helicon is very strong. For this reason helicons can be expected to inter- 
act strongly with the electron spins in ferromagnetic materials, or even 
with nuclear magnetic moments. 


(Il) PROPAGATION PERPENDICULAR TO THE EXTERNAL 
MAGNETIC FIELD B,. 

It is useful in this case to speak of components of fields in the direc- 
tion of By as parallel, and of the other components as perpendicular. 
These are now distinct from longitudinal and transverse components, 
contrary to the situation in the previous case. It is convenient, however, to 
take B, still in the 3-direction, and to speak of +, —, and 3-components. 
The propagation vector q may be taken to be in the 1-direction. Then, in 
(5.555), q X (q X Eq) will have components (0, — g?E,, — q?E;) where E, 
and E, are the second and third components of E,. Thus (5.555) gives 


D,=0 (5.588) 
2772 

D,=—FE, (5.589) 
2772 

D,=—TE, (5.590) 


Using the form (5.577) for e,, we may show that 


w? = w,?+ cq? (5.591) 


32. The Dielectric Tensor 335 


This mode, which is related to the plasma mode in the case of parallel 
propagation, is now transverse rather than longitudinal, and is no longer 
associated with a density fluctuation. It is linearily polarized in the direc- 
tion of the magnetic field. 

Equations (5.588) and (5.589) may be written 


€; E, + ié. Ep = 0, 


Cc 2q? 
@” 


€,E, = léE, = FE. 


There are nonzero solutions if the determinant of the coefficients is zero; 
this leads to the equation 


eS ee Se 
c*q ie Wee) (5.592) 


There are normal modes at those frequencies for which one or all of €,, €_. 
and e,, are positive. For small frequencies, only e_ is negative; beyond w, 
(supposed < w,) all three are. The first to become positive is e_, so the 
spectrum begins at the point e_ = 0, that 1s, 


W@ = Vay? + 30,7 — £W¢ 
~ Wy — We (5.593) 


Since at this point g=0, D. = 0. Consequently, only E_ can be #0. 
This band of allowed frequencies terminates when e, = 0, that is, when 


Ww? = Wy? + W” (5.594) 


At this value E, = 0, and we have a longitudinal mode, and corresponding 
density fluctuations. In fact, throughout the whole band such fluctuations 
will be present, since there is always a longitudinal component. 

There is a further band of normal modes at the frequency for which 


e, = 0: 
@=— V Wp? t+4w,? + 3W¢ 
= Wy the (5.595) 


At this frequency the ficld is of the form E,. Going toward larger fre- 
quencies, e, — e_ and e, > 0. Ttus, the wave becomes primarily trans- 
versely polarized in the direction , ..»nendicular to the magnetic field. 


336 Propagation and Scattering of Waves 


33. Excitation of Helicons by a 
Piane Wave Normally Incident on 
an Interface 


Consider the problem of a plane wave in empty space, impinging 
normally on a material which may be treated as a free electron gas. If 
the direction of propagation is the 3-direction, right and left circularly 
polarized waves, respectively of frequency w are characterized by the 
fields 

E, = E.. eiw((zic)—t) 4 Et. e~m((z/e)+8) (S. 596) 


and, from (5.584) 
H, = —— E’. pintzie—t) 4 —L E" e-iwl(zic)+#) (5.597) 
Moc * boc = 


The first terms correspond in each case to the incident wave; the second, 
formed by changing the sign of g, correspond to the reflected wave. 
In the medium, we have 


E.. = Ex’ eX@-e (5.598) 
H.. = and eilaz—wt) (5.599) 


where gq is given in terms of w by (5.581). Letting the interface be at z = 0, 
and applying the boundary conditions of continuity of E and H, 


E.. + Es = Ex’ (5.600) 
and 


E,—E" = TE (5.601) 


Solving for the transmitted wave in terms of the incident 


2 


EE.’ = T+ (cq/a) E.. (5.602) 
For the corresponding magnetic fields 
m — _2¢qlw_ yy, 
H”. = i+ cql@ H’. (5.603) 


Let us consider a frequency within the allowed band of normal modes. 
The electric and magnetic fields are then 7/2 out of phase; the Poynting 
vector is, therefore, simply their product. Thus the ratio 


Transmittedintensity  __4cq/iw ~~ 4we 


— —T0Ccq_ 
Incident intensity 1+(cqlw)?  w2+c2¢? (5.604) 


33. Excitation of Helicons by a Plane Wave 337 


This has its maximum value for cg = a, 1.e., 


w= wo, — 2 Wp? = 0 (5.605) 

For low frequencies, such transmission is only possible for right 
circularly polarized waves. 

When we are not within the bands of normal modes, g determined by 
(5.582) is imaginary, and the wave is exponentially attenuated within 
the material. The electric and magnetic fields within the material are in 
phase, and there is no energy propagation. 

We note that when the magnetic field goes to zero es are negative 
from zero frequency up to » = wp, and no propagation is possible. We see, 
then, that the propagation of the low-frequency helicons is possible only 
due to the presence of the magnetic field. The reason for this is physically 
clear enough: In a magnetic field, the electrons tend to move at right 
angles to the electric field of the wave. Thus, this field does no work on 
the electrons and there is no dissipation. 

In fact, in all real materials the situation is complicated by the fact 
that the electrons are scattered and thus acquire a component in the 
direction of the field, with the result that there is some attenuation. This 
is not important if the characteristic period of the electrons in the mag- 
netic field is much shorter than the mean time between scattering, that is, 
if w,.7 > 1. When w,7 < 1, however, there will be substantial attenuation 
of the helicon waves. 


Problem 5-34: Show that the transmission of a 
right circularly polarized wave (helicon) of frequency w < w, through a 
slab of material of thickness d is 
T= 4w*c*q" 
4w*c*q" cos? qd + (q*c? + w”)? sin? gd 


Sketch the form of the transmission as a function of the constant external 
field. 


Problem 5-35: Calculate the transmission of a wave 
similar to that in the previous problem, but with the magnetic field parallel 
to the surface. Show that there is an oscillating surface charge induced on 
the surface. 


Problem 5-36: Recalculate the dielectric constants 
when there are N, carriers having charge —e and mass m, and N, having 
charge +e and mass m,. Show that, if N, = Nz, the dispersion relation 
for the waves in the material is linear (# = constant X qg) rather than 
quadratic. (These are known as “‘Alfven waves.’’) 


338 Appendix 5A 


Problem 5-37: Calculate, for the case in which the 
magnetic field By is parallel to the plane surface of a very thick slab of 
material, the reflectivity of a wave linearly polarized perpendicular to the 
direction of By, and having a circular frequency between 


w= Vo,?+40,?—$, 
@ = VO? +4022 +$ 0, 


and 


Sketch your result. 


APPENDIX 5A: Approximate Formulas for Spherical Bessel Functions 
of Large Order 


(i). Method of 
Steepest Descents 


We shall first derive formulas for the spherical Hankel functions, and 
deduce ones for the Bessel and Neumann functions. 

The method used to derive these formulas will be the “method of 
steepest descents.” We first sketch briefly the main features of this 
method. 

We consider integrals of the form 


1= J expf(z) dz (A.1) 


in the complex plane, “C”’ designating a contour. We note first that the 
integrand is largest where Re f(z) is a maximum. 

Because of the exponential dependence, it might be hoped that the 
greater part of the integral would come from the neighborhood of this 
maximum. However, this may not be a useful observation for two reasons: 


(1) Re f(z) may not fall off rapidly enough from the point at which it 
is maximum, or 

(2) The imaginary part of f(z) may be rapidly varying, causing rapid 
oscillations in the integrand which may diminish its value near the point 
in question. 


We may try to improve the situation by replacing the given contour 
by an alternative one for which these two difficulties do not arise. We 
may, in fact, choose any alternative contour C’ such that there are no 
poles of the integrand between the original contour and the new one (and, 
of course, the new contour may not cross a branch cut). How, then, 
should we select this new contour? Clearly, we should choose it, if 


(i). Method of Steepest Descents 339 


possible, so that near the maximum of Re f(z) that quantity decreases as 
rapidly as possible, while at the same time Im/f(z) varies as little as 
possible. Fortunately, it is possible to satisfy these conditions simul- 
taneously, by virtue of the following theorems in the theory of functions 
of a complex variable: 

(a) If f(z) =u(x, y) +iv(x, y), the curves u=constant and v= 
constant are orthogonal at any point, u varying most rapidly along 
v = constant and vice versa. 

(b) If f(z) is analytic on and within a circle of radius R about a point 
Zo, the average value of the function around the circle is equal to its value 
at Zp, that is, 


F(Z) = [ : f(%m+Re®) d@ ~— (Ref. 4) (A.2) 


The second theorem clearly holds for the real and imaginary parts of 
f(z) separately, and states that neither can have a maximum or a mini- 
mum at any point at which f(z) is analytic. 

Suppose now we look for points at which the derivative f’(z) is 
zero. Then, both u and v will have extrema at that point. However, by the 
second theorem above, this extremum can be neither a maximum nor a 
minimum relative to all neighboring points. It must, in fact, then, be a 
‘saddle point,”’ such that the function increases for some displacements 
from the extremum, and decreases for others. 

Suppose, now, that we choose our contour through such a point. Let 
us choose the contour to pass through it along that path on which u= 
Re f(z) rises most steeply to a maximum. Along this curve v = Im f(z) is 
constant. 

If, now, we expand f(z) about the saddle point z, along this path, 


f(z) =f (2%) +3(z— 2) *f" (Zo) ++ °° (A.3) 


[where, because Re f(z) is a maximum at z = 2%, Re f"(z) < 0], we may 
write the contribution to the integral from the neighborhood of this point 
approximately as 


[= eft J e-(2Xz—z0)°L-F(2001 (A.4) 
Provided the expansion to the quadratic term in (A.4) provides a good 


approximation to f(z) over a distance along the curve a few times greater 
than 1/V—f"(z)), (A.4) will provide a good approximation to the integral /. 


‘For a discussion of these theorems, see, for example, Maurice Heins, Topics in Comp- 
lex Function Theory. New Y ork: Holt, Rinehart and Winston, 1962, chapter 4, pp. 55-57. 


340 Appendix 5A 


The above has assumed that the deformed contour may only go 
through one saddle point. If there are several, the contour may perhaps 
be chosen in alternative ways; one must remember always, however, that 
the contribution from the part of the contour away from the saddle point 
must be negligible. A contour passing through more than one saddle 
point is also possible, the contributions being added. 

If the subsequent terms in the expansion, the first of which is 
&(Z— Z)3f"”" (Zp), are negligible within the range 1/V—f"(z)), we can with 
negligible error replace the limits on the integral (A.4) by — © and © along 
the line in the complex plane corresponding to the direction of the contour 
at Z). Then, the integral may be evaluated to give, approximately, 


| 27 
[ = ef) —f"(Z) (A.5) 


The condition that this be a good approximation may, for practical 
purposes, be formulated as 


Ff" (Z) 


EF") ~ aS 


[It should be noted, incidentally, that since on the contour Im f(z) = 0, 
—f"(%) = — Re f"(%) = — u"(%).] 


(ii). Example: Asymptotic Expansion 
of the [ Function 


The I’ function is defined by 
Pn) = fox e-= dx (A.7) 


Replacing x by the complex variable z, 


f(z) = (n— 1) Inz—z (A.8) 
Then 
f@= 2-1 (A.9) 
which is equal to zero at 
Z=Ay=n-!1 (A.10) 


Also 
iQ) == (A.11) 


(iii). The Spherical Hankel Functions 341 


i") =p (A.12) 


i" @)== (A.13) 


_ 6 
(n—1)° 
Thus, from (A.5) 


I(n) = exp [(n— 1) In(n— 1)—(n— 1)] V2a(n— 1) 


= V2r(n— 1) (n— 1)? 7 eo (A.14) 
or 
Int (n+ 1)= (n+) Inn-—n+3ln27 (A.15) 


which is the well-known ‘“‘Stirling’s formula.’ We have, in this case, not 
had to deform the contour at all, since z, being real, already lies on the 
contour of integration. 


(iii). The Spherical Hankel 
Functions 


It is most convenient to work with the Hankel functions, because by 
taking real and imaginary parts we can extract both the spherical Bessel 
and Neumann functions. From (4.367) and (4.376) we see that h,“(x) 
may be expressed as 


l eo 
hf (x) =—i(5) ne | e~%g!(o —2i)' do (A.16) 
; 0 


Let us concern ourselves for the present only with the integral. It may be 
written 


1= [. & do 
where 
fio) =—oxt+/Ino+/In(o— 2) (A.17) 
Thus 
ee ee 
Ol = as, (A.18) 
and 
i l l 
, QQ >> sa (A.19) 


o* (o0—?2i)? 
The “saddle points” are given by f’(o) = 0 or 


xo? —2([+ ix)o+ 2li= 0 (A.20) 


342 Appendix 5A 


The solutions are 


I+ ix+1/?2_ x2 
2s (A.21) 


Let us consider first the case / > x. It is convenient to write 


l=xcoshu (A.22) 


the ‘‘saddle points”’ are then given by 


O+ = i+ etu (A.23) 
It is easily verified that 
fox) = — ix— xe™ + [In (et24#+ 1) (A.24) 
and 
"7 e =e] Ty sinh u 
NN a pera (A.25) 


How do we choose a deformed contour so as to get the best possible 
approximation to the function from the integrals near the ‘‘saddle points”? 


Consider first o_. At that point f” is positive and real; the function is 
given approximately by 


f(a) = f(o_)+3(o —o_}? f"(o_) (A.26) 


If o goes through a_ following a path parallel to the real axis, it passes 
through a minimum at o_. On the other hand, if the path is parallel to the 
imaginary axis, it goes through a maximum at o_, and the imaginary part 
of fis constant along that path. Thus, we want to pass through o_ along a 
course parallel to the imaginary axis. 

Next consider o,. Since f"(o,) is negative, the proper path through 
this point is parallel to the real axis, Figure A.1 gives a sketch of a possible 
contour which passes through both saddle points. 

The contribution to the integral from a_ is 


2a 


Loses -u 
2 (2 cosh u) le“ sinh u 


The contribution to h,™ is then 


ml —re~U—lu , j__ 20 
[! x reine le“ sinh u (A.27) 


(iii). The Spherical Hankel Functions 343 


Figure A.1 


For small x this should, of course, go like x". But if x < 1, 


coshu = sinhu = je" = : (A.28) 
so that (A.27) becomes 


l 
vi) ne (A29) 


Let us turn now to the contribution to the integral from o,. It is 


21 
—iz p—re% plu M1. fo 
4e-%e—re" ee (2 cosh u) View =a 


This therefore contributes to h;™ the term 


a ake —xre"% ,lu _ 20 
x I! ‘ = V le-* sinh u (A.30) 


Let us see how this behaves for x < /. Using (A.28) we have 


21 [+1 e72l aT 


-i(5} TW VT (A.31) 


Combining (A.27) and (A.30) or, for x < 1, (A.29) and (A.31), we have 
approximately, for x < J, 


Jt] = 20 
(1) eee a —lu fae ee ee ts 
AMX) Haye €™ Nic sinh w 
im —re" plu _ at 
x i © Nie=“sinh u (A.32) 


_ fe (x)! ey, (21) eer 
~ V7 (3) ne""-i(Z) TE (a3 


344 Appendix 5A 


But since the real and imaginary parts are the spherical Bessel and 
Neumann functions, respectively, we have 


[+1 1 _ ar 
7 = — --re —lu eee eee 
jx) = xe e V isinhy? I) (A.34) 


l 
- \3 (3) oe <D (A.35) 
and 
pei] ‘ 2a 
pre” Llu —— ee 
n(x) = Tze e View Shi (x < l) (A.36) 
~ 2 (2\" a l Z 
~ (=) To Leo (x <1) (A.37) 


We see, then, that the major contribution to the real part of h,; comes 
from the neighborhood of o_, and to the imaginary part from o.,. Provided 
x is not too close to /, these two points remain sufficiently well separated 
that the chosen contour can be used. As x — J, the two singularities 
approach each other, and the method runs into rather serious difficulties. 
At x=/, not only is f'(o0) zero at o=1+#i, but f"(c) is also, and the 
leading term in the expansion of f(a) is a cubic one. Thus, for x near J, 
it is essential to take account of the cubic terms. The situation is then 
more difficult, but formulas for large / may still be obtained. They are, 
however, not of sufficient interest in the problems we are considering to 
be included here. 

It is perhaps logical to write the formulas for j, and n, only in terms of 
u and /. Using (A.22) to eliminate x, we find that 


; ~ _ 20 —l aie cosh u —lu—tanh u) 
hx) = Vie«sinhu’ I! 1 © (A.38) 
ee _ 20 —l has coshu Wu—tanh xu) 
Wx)" Nesohie i fo Oe 


Using (A.15) for the factorials of large integers, these become 


and 


cosh u 
j (x) eS ee e u-tanh u) (A.40 
lV e“ sinh u 


and 
cosh u 


2 ee eltu—tanh u) ( A.41 ) 
IVe-* sinh u 


n(x) = — 


(iii). The Spherical Hankel Functions 345 
To see how rapidly j, decreases as x decreases below x =I, put 
tanh u = y. Then 
I(u—tanh uv) = [(tanh7! y—y) 
= 4ly3 


for y not too large. 
Putting x = /—x', where x’ < l, 


Therefore, approximately 


Oe a ae al 
(u—tanh u) = 31(3) = j 


Thus, for the exponent to be equal to y, we must have 


‘= (5 ¥) BA 


What are the conditions that these formulas yield reasonably good 
approximation? Basically, it is that f” be large near the saddle points, 
since then the integrand drops off rapidly on each side of the pass. We 
note that 


--ENOa 
EVO 


Therefore, the approximation to j, depends on V2? —x?[1+ V1— (x/F)] 
being large, while that to n, depends on VP — x?[1— V1 — (x?/[) ] being 


346 Appendix 5A 


large. Since these quantities are, in fact, exponents in Gaussian functions 
in the integrand, the values do not in fact have to be very large. The 
approximation to n, appears to have a considerably narrower range of 
validity than has that to j,. Although it appears from the above considera- 
tions that (A.37) may not be reliable. it does in fact give. the correct 
behavior for small x. In fact, using Stirling’s formula for large /, it becomes 


n(x) = te (A.42) 


and this is precisely what 1s obtained from the series expansion [see 
(4.377)]. 

We have pointed out that when x is in the neighborhood of /, the two 
‘“‘saddle points” are close to each other, and the situation becomes 
complicated. However, once x becomes substantially larger than /, the 
method of steepest descents again gives reasonable results. We start 
again from (A.21) but now put 

l=xcosv (A.43) 


In terms of v the saddle points are then 


Oo. =Ii+ eri (A.44) 
It is also easily verified that 


f(o+) =—ix—x et + ilv +n (2cos v) (A.45) 
and 
SIN v 


Mf = 7 j] priv 
f'(o.) =File eaeep 


(A.46) 


How, then, in this case, do we define the deformed contours? They 
should pass through the points (A.44) in the directions along which the 
imaginary part of f (7) is constant. Now the poles are located as indicated 
in Fig. A.2. 

To see in what directions the contours should pass through these 
points, we put z= o0—o.. = |z| e'5+. The dependence of f on z near this 
point is then given by 3|z|? e2#+f” (o.). In order that the imaginary part 
should not depend on z and for the coefficient of |z|? to be negative, we 
must take the total phase to be 7 or —7r. Therefore let us take 


+5 Fo+8.= Fa 


so that 6.. = + (v/2) = (7/4). Since we may take (for positive x) v =< (7/2). 


(iii). The Spherical Hankel Functions 347 


Figure A.2 


the directions which must be traversed (or their opposites) are as shown in 
the figure; the angles which the curves make with the horizontal are 
(1/4) — (v/2). 

A simplest way in which to get an asymptotic evaluation is to choose 
the contour C through o_. The value of the integral is, then, 


21 


= lsinv 


It follows from (A.16) that 


6) —_;{x are 27 yt cl 
hi (x) i(5) ne Tsino ~ cost! py exp 


- ix—xe-*—i| (145) oF] (A.48) 


Using Stirling’s formula for /!, substituting x = 1 sec v, and simplifying 
we obtain 


h(x) = el ex i| tan —(i+5) =4 A.49 
ae NO a a) 


Real and imaginary parts may now be identified to give j,(x) and n,(x). If 


x 
LAOS: v= 


Nia 


, sinv = ] 


348 Appendix 5B 
we get the familiar asymptotic form for large x: 
1 . l\7 7 
h{ (x) = = exp i Jx— (+5) 5-5 (A.50) 
APPENDIX 5B: Units in Electromagnetic Theory 


The two most commonly used systems of units are the mks and the 
Gaussian. Maxwell’s equations in the two systems are 


mks Gaussian 
V-D=o0 V:-D=479 
V-B=0 V:-B=0 
_ _ 9B — _10B 
VxE= ry VXE mary 
aD _4n,,1aD 
VxH=J+— VXH=—J+c5, 
D = €€,E D=ceE 
B = pjHH B = pH 


Lorentz’s equations of motion for a charged particle in an electromagnetic 
field are 


m = = e(E+VXB). mG, = e(E+LV xB) 


dt 
mks unit + 
Physical quantity mks unit Gaussian unit Gaussian unit 
length meter centimetre 100 
mass kilogram gram 100 
time second second 1 
force newton dyne 105 
energy joule erg 107 
charge coulomb statcoulomb 3 x 109 
charge density coulomb/ statcoulomb/cm? 3x 10° 
meter® 
current ampere statampere 3 <X 109 
current density ampere/ statampere/cm? 3 x 10° 


meter? 


Appendix 5B 349 


mks unit + 
Physical quantity mks unit Gaussian unit Gaussian unit 
electric field volt/meter statvolt/cm 3 X 10-4 
potential volt statvolt — 
conductivity mho/meter sec! 9 x 109 
magnetic induction weber/meter? gauss 104 
magnetic field ampere-turn/ oersted 4r X 1073 


meter 


PRELUDE TO CHAPTER 6 


This chapter is concerned with two different, but often 
related, types of problem. One concerns the diffusion 
equation, which is linear in the time derivative and so 
describes irreversible processes. The heat conduction 
equation has the same form. The second is the equa- 
tion which appears in the theory of diffusion of thermal 
neutrons in reactors, of dissipative media and skin 
effect in electromagnetic theory, and of supercon- 
ducting electrodynamics. These latter phenomena are 
all characterized by exponential decay of the relevant 
physical quantity with some coordinate. 

The chapter opens with a discussion of time- 
irreversible phenomena, illustrated with the linear 
differential equation of second order. The diffusion 
equation is then derived from the continuity equation 
and Fick’s law, which states that the diffusion current 
is proportional to the density gradient. The heat con- 
duction equation is derived in a similar way, and leads 
to the same kind of differential equation. Boundary 
conditions are discussed for each problem. 

The Green’s function for an infinite medium is 
derived by transform methods, as was done in the 
case of the wave equation. The Green’s functions for 
semi-infinite media with various boundary conditions 
are derived by the method of images. The same 
method is used for a slab (finite one-dimensional 
medium). 

The solution of the latter problem is also obtained 
by Fourier series. This solution can be obtained from 
the image one, or conversely, by the use of Poisson’s 
summation formula (Chapter 4). 

The Green’s function is calculated for spherical 
geometry, and the relation to the general formulas 
relating point, plane, and spherical shell Green’s 
functions, introduced in Chapter 3, is discussed, with 
particular reference to the role of boundary conditions. 

The next problem treated is analogous to one dis- 
cussed in the preceding chapter for waves — that of the 


351 


352 


Prelude to Chapter 6 


solution determined by given initial and boundary 
conditions. Again, it is shown that each condition 
effectively provides a distribution of sources, de- 
scribed by integrals over Green’s functions; once 
again, the Green’s functions are seen to be the same 
as those for real sources. 

It is then noted that the time-dependent Schréd- 
inger wave equation differs from the heat conduction 
equation only in that its time derivative term has an 
imaginary rather than a real coefficient. Thus, the 
Green’s function for this problem may be derived ina 
manner quite parallel to that developed for the heat 
conduction equation. 

We now turn to the problem of neutron diffusion 
in chain reactors. A simple standard model is used— 
high-energy neutrons are assumed to slow down to 
thermal equilibrium with their surroundings, to diffuse 
in this state, and then to be captured; they may or may 
not then give rise to fission which produces new high- 
energy neutrons. (But energetic and thermal neutrons 
may of course be lost by escape.) The model is dis- 
cussed by treating the coupled equations of “‘fast’’ and 
‘thermal’ neutrons. The criticality conditions are 
derived in general, and the influence of fission neutrons 
emitted after a delay on the rate of change of neutron 
intensity is considered. 

Fourier transform theory is next used to introduce 
the notions of diffusion length of thermal neutrons 
(root mean square distance travelled during diffusion), 
slowing-down length (root mean square distance 
travelled during the moderation of slowing-down 
process), and “‘migration length” (rms distance traveled 
from birth to death). The influence of fission on these 
concepts is also determined. 

We then turn to the propagation of electromag- 
netic waves in a dissipative medium. The equation 
here is both wavelike and diffusionlike, containing 
both first- and second-order derivatives in time. The 
appropriate Green’s function is derived. The problem 
of propagation into a conducting medium is also dis- 
cussed; it is shown that there is exponential decay of 
the fields into the medium, characterized by the so- 
called ‘“‘skin depth,’ which depends on the frequency 


Prelude to Chapter 6 353 


of the wave. The skin effect phenomenon is discussed 
in general by means of an integral equation derived by 
considering the field outside the absorbing medium as 
a source. 

The electrodynamics of superconducting media 
are shown to be described, under suitable assump- 
tions, by an equation which is wavelike in time and 
neutron-diffusion-like in space. A generalization to 
take account of the ‘‘coherence effects” first noted by 
Pippard is also introduced; it is found that it gives 
rise to no fundamental difficulties (but practical com- 
plications) when the Green’s function is determined by 
Fourier transform methods. If the “‘coherence length”’ 
goes to zero (London limit) the Green’s functions are 
found to take on relatively simple forms involving 
Bessel functions. 

Finally, it is noted that the meson field of Yukawa 
has the same mathematical form as that of super- 
conducting electrodynamics in the London limit, so 
that the preceding results can be carried over to 
classical meson theory. 


REFERENCES 


Carslaw, H. S. and J. C. Jaeger, Conduction of Heat in Solids. New York: 
Oxford University Press, 1959, 2nd ed. 


London, F., Superfluids. New York: Dover Publications, 1961, 2nd ed., Vol. 1. 


Morse, P. M. and H. Feshbach, Methods of Theoretical Physics. New York: 
McGraw-Hill, 1953. 


Sommerfeld, A., Partial Differential Equations in Physics. New York: Academic, 
1949, Chapter 3. 


Wallace, P. R., Nucleonics 4, Nos. 2, 3 (1949). 


Wigner, E. P. and A. M. Weinberg, The Physical Theory of Neutron Chain 
Reactors. Chicago: University of Chicago Press, 1958. 


PROBLEMS OF 
DIFFUSION AND 
ATTENUATION 


“You can never plan the future by the past’ 
Edmund Burke 


1. Introduction 


The problems of wave propagation with which the last chapter was 
concerned had the property that the basic equations were time-reversible, 
that is, were unaltered when the time variable t was replaced by —t. 
Nevertheless, the direction of time had to be built into a theory which 
purported to describe nature. This was accomplished by assumptions 
extraneous to the field equations themselves; for instance, in problems of 
radiation a selection was made of retarded potentials, which represent 
a limitation to solutions such that a source at t’ produces a field only at 
time t > t’. Mathematically, advanced potentials, in which t is replaced 
by —t, are equally possible, as are linear combinations of the two types. 

A very simple example will illustrate the point in question. Con- 
sider Green’s function for the harmonic oscillator equation; it is the 
solution of 


2 
F8 4 Ng=— a(t) (6.1) 


If we take the Fourier transform of this equation, which is of course 


355 


356 Problems of Diffusion and Attenuation 


time-reversible, we obtain 
1 ' 
2 — 2 ty — —iwt! 
(w? — A?) 2(w) an e (6.2) 


where £(w) is the Fourier transform of g(t). If now we try to invert the 
transform to obtain g(r), 


to.) 


e(t—r)=s | ate” do (6.3) 


The problem arises that the transform has poles on the real axis. A formal 
way of overcoming the difficulty is to displace the contour slightly either 
above or below the real axis, treating w as a complex variable. If the 
contour is displaced upward by e, the poles lie below the contour. If 
t > t', the integral can be evaluated by deforming the contour around the 
upper half-plane, and the value of the integral is found to be zero. For 
t < t', it must be deformed around the lower half-plane, and g(t—1’) is 
found to be 


g=<sin\(t—1') (6.4) 


This represents an anticausal situation; the source makes itself felt 
only at earlier times. 

If, on the other hand, the contour in (6.3) is displaced downward by 
e,g=Ofort < t’,and becomes, fort > ¢’ 


ge * sin M(t'—2) (6.5) 


This solution is then causal. 

The arbitrariness does not arise in real systems since there must 
always be some dissipative mechanism present. This can be represented 
by a frictional resistance term «x(dy/dt). Such aterm is of course not time- 
reversible. With «x > 0, the term «(dy/dt) added to the right-hand side 
of (6.1) gives rise to dissipation of energy from the oscillator with in- 
creasing ¢. A reversal of the sign of this term would correspond to an 
increase in energy with time. Thus, the dissipative process serves to 
define a direction of time for the problem. But it also removes all ambiguity 
from the mathematical problem. Taking the Fourier transform of the 
modified problem, we find that 


| 
27 — \2)6 = — 
(w* — ikw— d*) 8 an 
or 


Le ae SCE een enen 
8 2a (w— (i«/2))? + (2 — (12/4)) 


We assume that \2— (x2/4) > 0. We see that the poles of the inte- 
grand are in the upper half-plane. But for ¢ > ¢' the contour must be taken 


(6.6) 


2. Diffusion ina Gas ora Solid 357 


around that half-plane, while for t < tr’ it must be taken around the lower. 
It then follows automatically that g=0 for t' > t and is different from 
zero for t’ < ft, that is, it is causal. 

It is primarily with this sort of problem, involving terms of the first 
order in time and therefore not time-reversible, that we deal in the present 
chapter. We are concerned primarily with problems of thermal conduc- 
tion; of diffusion, including neutron diffusion; and of electromagnetic 
wave propagation with losses. However, at the end of the chapter, we 
also treat problems of spatial attenuation, which are mathematically 
related in some of their features to the above problems. A problem of 
particular interest is that of superconducting electrodynamics. 


2. Diffusion in a Gas or a Solid 


We consider the diffusion of a substance in a uniform background, 
which may be either gaseous or solid. The basic equations are two in 
number: (i) an equation of continuity, which relates changes in density 
of the diffusing substance to its flow, and (ii) a ‘“‘constitutive”’ relation 
which specifies how the flow is determined by the density distribution. 
The first relation is trivial and contains no reference to physical mechan- 
isms; it simply asserts the indestructibility of matter. The second is an 
expression of the physical mechanism of the diffusion process. 

Let the density (number of particles per unit volume of the substance) 
be designated p(r,t) and the current j(r,f), where j(r,t)-:ndS is the 
number of particles flowing per unit time across an element of surface dS 
to which n Is the unit normal vector. The continuity equation is, then, 


dp j= 
atv i= 0 (6.7) 


So far as the constitutive equation is concerned, we assume “‘Fick’s law”’: 
J=—DVp (6.8) 


We do not enter here into the question of the proof of this equation, 
though with special assumptions it will be proven in the next chapter. In 
the case of diffusion in a solid, various particular assumptions have to be 
made, in particular isotropy (without which the D would become a tensor 
quantity). 

Substituting (6.8) in (6.7) leads to the ‘‘diffusion equation”’ 


9P _ pny 
7 DV?p (6.9) 


This is, of course, of first order in time, and not time-reversible. The way 
in which the direction of time is specified here is made clear by imagining 


358 Problems of Diffusion and Attenuation 


the physical consequences of reversing the sign of t. This would imply a 
flow from regions of low density to regions of high density. The direction 
of time is then specified by the tendency toward increasing entropy in 
the process of time evolution. It is the statistical character of the pheno- 
menon, rather than a mechanism of dissipation, which is crucial. 

A word about boundary conditions would be in order. If diffusion 
takes place within an enclosure, the appropriate condition is that the 
normal component of current, —Dn- Vp, is zero on the boundary. If 
escape of particles is possible over the boundary (for example, for diffus- 
ing particles of gas in a solid) the situation becomes more complicated. It 
may be shown in fact that an appropriate boundary condition is to put the 
density zero at or near the boundary. The difficulty is that Fick’s law can 
in general be proven only if the concentration gradient is not too large. 
Since at a surface all particles are flowing outward (i.e., the angular 
distribution is such that the current is zero over 27 steradians) the con- 
centration gradient becomes large, and Fick’s law is no longer valid. The 
same is true in the neighborhood of a source. Although, strictly speaking, 
the problem should be treated in these cases in a more accurate way 
(by means of a transport equation), for many purposes it is sufficient to 
use the approximation of elementary diffusion theory, especially if one is 
not specifically interested in the details of the solution near sources or 
boundaries. 

At an interface between media, the conditions to be applied to the 
solution are 

(a) that the normal component of current — Dn - Vp is continuous 

(b) that the density itseif is continuous at all points of the surface. 


3. Conduction of Heatin a Solid 


Consider a solid containing a heat source distribution S(r, £), that is, 
S(r,¢t) is the amount of heat energy produced per unit volume, per 
unit time at the point r and the time ¢. Let U(r, ft) be the heat energy 
per unit volume and j(r, ¢) - n dS the heat energy flowing per unit time over 
area dS with unit normal vector n. The equation of continuity of energy 
then has the form 

aU 


ot Vv I=S (6.10) 


an equation identical in form to that for diffusion. 

Let us write both terms on the left-hand side of (6.10) in terms of 
the temperature. If the temperature gradients are not too large, it may 
be shown by the use of the standard solid-state theory,' that the heat flux 


1See, for example, Mott, N. F. and H. Jones, Properties of Metals and Alloys. New 
York: Oxford University Press, 1936, pp. 305-307. 


3. Conduction of Heat in a Solid 359 


is proportional to the negative gradient of temperature 
j=—KVT (6.11) 


x being the thermal conductivity and T the temperature. 

As for the first term, we know that changes in temperature produce 
proportionate changes in heat content, the proportionality constant being 
the specific heat C. Thus 


dU_ oT 
ae C ry (6.12) 
Consequently we obtain an equation for the temperature, 
OT Keep _S 
CO V?T C (6.13) 


which again has the form of a diffusion equation. 

Possible boundary conditions may be specified as follows: 

(a) At an insulating surface, there will be no heat flow perpendicular 
to the surface, so that 


kK = (6.14) 


(b) If a surface is held at a constant temperature 75, then over the 
surface T = Ty. Since (6.13) contains only derivatives, the zero of tem- 
perature may be arbitrarily chosen, and it is convenient to choose it so 
that 7, = 0. 

(c) If there is radiation from the surface into an effective vacuum or 
a gas, a more complicated condition applies. The rate of radiation from a 
medium at temperature T is known to be given by Stefan’s fourth- 
power law, that is, it is of the form oT‘. If the surface of the medium is at 
temperature 7, and its surroundings at temperature 7,5, the net rate of 
radiation is 

= «2 = o(T*— To) (6.15) 
We must now measure the temperature T on the Kelvin scale. Equation 
(6.15) leads to very difficult mathematical problems in general. However, 
if we are concerned only with a modest range of temperature, 


T*—T,* = 4T,?(T —T)) (6.16) 
This is a good approximation provided 
T-— To < To 


Using this approximation, the boundary condition becomes 


—«% (T—T)) = 40T,3(T To) (6.17) 


360 Problems of Diffusion and Attenuation 


or 
< (TT) =—a(T—T») (6.18) 
nh 
where eons 
qe ae (6.19) 


It is now possible to return to a scale in which Ty = 0, and the boundary 
condition is of a familiar linear type. 

(d) At an interface, T will be continuous and the normal heat flux 
« (dT/dn) will be continuous. 


4. Green’s Functions for Diffusion 
and Heat Conduction Problems 


Let us first consider a point source at time /) in an infinite medium. 
With no loss of generality, we may take t) = 0. The solution is then the 
infinite-medium Green’s function 


ao BV°G = 8(r)3(t) (6.20) 


If we take a Fourier transform in spatial coordinates and a Laplace 
transform in time, and call the transformed function G,(K, s), this function 
satisfies 


(s+ Bk?) G,(k, s) = (Ga) 


1 1 
(27r)* s + Bk? 


If we now take the inverse Laplace transform to obtain G,(K, t), we obtain 


or 


G,(k, Ss) = (6.21) 


—_—! ener 
G,(k, t) —_ (2mr)3 © (6.22) 
The Green’s function G (r, f) is now the inverse Fourier transform 


G(r,t)= | e Pkt eik-r J 3k (6.23) 


ete. 
(27r)° 
Integrating in the familiar way over the angles of r gives 
1 f° sin kr 
sys —pK2t 22 *" p2 
G(r, t) | e is k? dk 


= = im | e bkteikrk dk 


cd 
4n’r ior d_,, 


e Pkt eikr dk (6.24) 


5. Green’s Function in One Dimension 361 


This may be written 


a exp |—(kvai— sf | dk e7r7148t 
4n7’r Ss idrJ_, ie 2Vept 


The contour between —o and © may be replaced by an alternative one 
displaced parallel to the real axis by —r/2Bt, since there is no singu- 
larity of the integrand between them. The integral is then a familiar 
Gaussian integral, the value of which is V7/Bt. Therefore, 


see Ny st —r?/apt 
Sa 47r*r Bt or. 


1 3/2 
7 (ea) en 7214Bt (6.25) 


The above derivation is of course valid for t > 0; fort < 0, G(r, t) = 0. 


Problem 6-1: Show that the one-dimensional 
Green’s function, representing the solution due to a source of unit 
strength per unit area on the plane z = 0, is 


| aes 
Gpi(z, t) = Campin? ie 


Problem 6-2: Show that the solution for a spherical 
shell source at r’ is 


1 1 ree ree 2a 
G,,(r, r’, t) = (4a Bt)? 2arr’ er + sinh 5 By 


5. Green’s Function in One 
Dimension — Semi-infinite and 
Finite Media 


Starting from the result of Problem 6.1, that the one-dimensional 
infinite-medium Green’s function is 


= eee ae —22/4Bt 
G (z, t) (amp e (6.26) 
it is possible to derive Green’s functions for semi-infinite and finite media 
with plane boundaries, for each type of boundary condition discussed 


above. In the next section, similar problems are solved for spherical 
media. 


362 Problems of Diffusion and Attenuation 


A. SEMI-INFINITE MEDIUM, SOURCE AT z=z 


(a) Boundary z = 0 at constant temperature T = 0. 
The method here is to introduce an image source of negative strength 
at z = —z, to obtain the solution 


G = G(z—2,t) —G(z+ %, t) (6.27) 


(b) Insulated boundary at z = 0. 
If one introduces at z = —z) a source egual to that at z = Zp, 


G = G(z— 2, t) +G(z+2, t) (6.28) 


the flux over the boundary from the image source is equal and opposite 
to that of the original source, giving a net flux equal to zero. 


(c) Radiative boundary condition (a@7T/az) —aT =0 
at z = 0, the medium being in the region z > 0. 

In this case the choice of an image system is not obvious. Let us then 
take a distribution of images f(z) for z < 0, and try to choose f so as to 
satisfy boundary conditions. We wish therefore to apply the boundary 
condition to 


0 
G = G(z— 2%) +f. G(z—z') f(z’) dz’ 
Imposing the boundary condition, the equation to be satisfied is 
0 
G'(—z)— J Gz) F(z’) ae’ 


—a | G2) + f. G(z') f(z’) dz'| = 0 


Let us integrate the second term by parts, taking f(0) = 0 and using the 
fact that G(— ©) = 0. This leads to the equation 


i G(z') [f' (2’) — of (z')] dz’ = G'(— 2) + aG (— 2) 


Defining h(z’) by 
f(z’) = &(z' +2) + A(z’) (6.29) 


we find that h(z’') obeys the equation 
f. G(z')[h'(z')— ah(z')] dz’ = 2aG(— 2) 
This may be satisfied provided 
h'(z') — ah(z') = 208(z' + 20) (6.30) 


5. Green’s Function in One Dimension 363 


The solution of this equation for the boundary condition h(0) = 01s 


h(z') = 0, Zi > —% (6.31) 
= —2ae%@ +2), zi<—Z% 


The Green’s function for this problem is, therefore, 


G = G(Z—%)+G(Z+%)—2a |] G(z— 2’) e%@' +2) dz’, (6.32) 


The last integral may be evaluated in terms of error functions: it is 


1 — Zo 
| eo @—Z'P4Bt galz' +2) dz! 


V47Bt J-= 
After a little manipulation, this may be put in the form 


+2%+2apt 
J eatet+z) gore | 1 — erp SA SOU SOR 
z€ “ 2V Bt 


where 


2 By og y 
= —¥ qd 6.33 
erf x = ; e y ( ) 
Therefore, 


2 Z+Z%o+t+2aBt 
G=G(Z— 2%) + G(Z+ 2) — ae%@ +20) e@?Bt | | — erf-—29 34 
(Z— %) + G(Z + Z) € 2V Bi (6.34) 


B. FINITE MEDIA BETWEEN z=0 AND z=a, SOURCE AT 
Z=%<a 

For the simple boundary conditions (insulating boundaries, both 
boundaries held at a constant temperature) the sort of successive image 
approach used in electrostatics leads directly to finite-medium Green’s 
functions in terms of an infinite series of infinite-medium ones. For 
example, with T = 0 at the boundaries z = 0 and z = a, we require nega- 
tive images at (2na—z) and positive ones at (2na+z). The Green’s 
function is, therefore, 


G= >> G(Z — Z — 2na) — »> Go(Z+ Z — 2na) (6.35) 
G, being the one-dimensional infinite-medium Green’s function 


] _22 
Co= Grpni? © 22/4Bt (6.36) 


The function (6.35) may be transformed by Poisson’s formula [see 
Chapter 4, formula (4.82)] into an alternative series. In the present case 


364 Problems of Diffusion and Attenuation 


the alternative form is 


@ 
G= wT g (=) [ e2mim(z—20)/2a e2mime+z0)/2a] 


m=—0 


— 40 x g (==) sin sin = (6.37) 
a a a a 


where g(k) is the Fourier transform of Go, 


g(k) =5- — ¢-Pht (6.38) 
Consequently we find that 
G(z, Zo. t) = “ > er? ma Bl sin a sin7 (6.39) 
m=1 


Problem 6-3: Obtain (6.39) by solving the problem 
directly by expansion in Fourier series. 


Problem 6-4: With the same geometry but with the 
two boundaries insulated, show that the Green’s function may be written 
as 


(i) G= S¥ G(z—% — 2na) + Ss G(z+ 2 — 2na) 
or - 
we _ 1 = —(472m2/a2)Bt TNL wmZ 
(ii) G 7 > e cos —* cos — 


Other boundary conditions may be handled by different devices. Let 
us consider examples 


(a) z=Oat T=0;z=aatT=Tpo. 
In this case we note that T)(z/a) is a solution of the diffusion equa- 
tion. (T — To(z/a)) satisfies the boundary condition that it is zero at x = 0 
and x = a and is, therefore, given by (6.39). 


(b) z=Oat T= 0; z=a insulated 
The solution in this case has the Fourier series expansion 


T = S ¥n(t) sin -————— (nreims (6.40) 
Substituting in the equation, and making use of the fact that 


_o\e . (n+2)7Z 
5(z—%) > &n sin-—7— (6.41) 


5. Green's Function in One Dimension 365 


where 


C,= e sin (n+3)m2% (6.42) 
a a 


we find for y,,(t) the caeee 


dyn, (n+3)?n* sin Let 2) 
adi. — q@ = a(t) 5 a 
The solution is 
Yn(t) =< = sin (ntars e-lin+(1/2))2n2/a71Bt (6.43) 


Substituting this in (6.40) gives the required solution. 


Problem 6-5: Using Poisson’s formula, show that an 
alternative form for the solution is 


je — 


) [ e7 (2-204 4na)2/4Bt as e7(z+20+4na)?/4Bt 


Tacs 


n=—@ 
+ oe [zt+z0+(4nt2)a]2/4Bt a e [zt zot(4nt+2)a}7/46t] 


= > [Go(z—2 + 4na) + Go(zt+ mt (4n+2)a) 
—G(z+ut+4na) —G(z—%+ (4n+2)a)] 


Show how this result can be derived directly. 


(c) Radiative boundary conditions at x = 0 and x=a 

A direct approach to this problem by the method of images is very 

cumbersome. Instead, let us use the method of separation of variables. 
Putting 


T =f (t)g(z) (6.44) 
substituting in 
aT_,#°T _y 
at 02 
and dividing by 7, we obtain the equation 
ld d? 
Each side of the equation must be a constant. Put 
d 
qnoés 


so that 
2g=AsinAz+Bcos Az 


366 Problems of Diffusion and Attenuation 


Application of the radiative boundary condition 


ag, 
dz +ag=0 (6.46) 
at x = a and x = 0, respectively, yields a transcendental equation for )\: 
tan\a= “ (6.47) 
If we put 
6 _ a 
tan ah (6.48) 
Equation (6.47) gives 
ba=6 or lLa=6+7 (6.49) 
The transcendental equation for \ may then be put in the form 
Aa_ @ 
tan aa (6.50) 
or 
Aa_ @ 
— cot = 5 (6.51) 


The nature of the roots is demonstrated in Fig. 6.1. This illustrates the 
curves corresponding to the right- and left-hand sides of (6.50) and (6.51) 


' \ 
| —cot 3 | 
C; ; | | | 
I 
Aa 
tan->~ Bi | j 


/ “ 
D, 


— ome cee ome oe eee oo ww 
— 


C / 
/ 
i / 
D ‘Dy 
3 / 
PC, 
io: te 
/ n  P,fi2a rs 3m 4n NS 0, ha 


~ 
ee ee 
a 
~ 
~ 
ee 


Figure 6.1 


5. Green’s Function in One Dimension 367 


intersecting at points C,, C,, C3, C4,...; Dy, D2, D3,... for which Aq Is 
given by O,, Oz, Oz, O4,... and P,, Ps, P3,.... The location of the roots 
depends on the magnitude of aa. If the rate of radiation at the surface is 
rapid, aa is large and the lower roots are close to \a = n(n = 1, 2,...). 
If it is slow, aa is small and the roots are close to Aa = na(n = 0, 1,2,...). 
The first case approaches that of constant temperature at the boundaries, 
the second, that of insulated boundaries. 
Since, from the boundary condition at z = 0, 


Nee O 
7 aa cot (6.52) 


the case Aa = 5 corresponds to the solutions which are even with respect 
to the midpoint of the slab: 


8 = bn = yf 008 da( -§) 


while the case Aa = 6+ 7 corresponds to the odd solutions 


2 a 
&=o,= V2 sin al -§) 
The solution of the spatial equation can be written 


g=Ccos (az-3) (6.53) 


When a — ©, § > ar the solution takes the form 


fern Me 
g=Csin 7 
when a > 0,5 — Oitis 
= AKLES 
g=Ccos F 
both of which clearly satisfy the corresponding boundary conditions. 
Labeling the roots of the transcendental equation A, in the general 


case, and calling the corresponding 6’s determined by (6.48) 8,, the 
functions 


bn = 2 COS (A,Z— 36, ) (6.54) 
form an orthonormal set. 
To find the Green’s function for this problem, that is, the solution of 


2 
86 — BES = (2-2) 8(0) (6.55) 


we first expand 6(z— Zo) in terms of the @,/s, 


5(z—2) =X bn(Z) bn (2) (6.56) 


368 Problems of Diffusion and Attenuation 


If we also expand 


G(z,t) => ¢n(t) bn(z) 
we may substitute in (6.55) and equate separate coefficients of ¢,: 


Cn + BAg2Cn = Gn (Zo) 5(t) (6.57) 
The solution ts 
Cy(t) = O(t)bn(m)errn™ 


where, as usual, 9(t) = 1 for t > 0 and 0 for t < 0. The Green’s function 
is, therefore, 


G(z,t) = 0(t) ) dn (Zo) Pn (Zen 


= 2 = _ 9p _ 9, —Brn2t 
= 0(t) . py cos (raze ) cos (rxz a)e (6.58) 


Problem 6-6: A long cylinder of radius a is heated to 
a temperature TJ, and then cooled by having its surface kept at tempera- 
ture T= 0 from time t = 0. Show that the subsequent temperature dis- 
tribution in the cylinder is 


T(r,1)=2T S tolyalie) ~y,26t2 


n=1 


where Jo(y,,) = 0. 


Problem 6-7: If, in the preceding problem, the cylin- 
der is allowed to cool by radiation, show that the temperature is 


J (An) Jo(An(r/a) ) ew hn? l@ 
T(r, t) = 2T 5 > = eee. 


where the A,,’s are solutions of 
AJ; (A) = aaJo(r) 


Problem 6-8: If a substance diffuses down a thin 
tube of length /, there being no diffusion over the sides; and if the rate of 
influx at the end x = 0 is f(t) while the end x = / is held at a fixed concen- 
tration Cy, show that the concentration is given by 


c(x,t) = {Co ji == >) var — 1)" ‘Caaduedal 


= 


ail A2n+1)2r2Dt-U)/4lZe (4! ' (2n+ 1) ax 
ak f(t!) dt} cos C2 Uae 


6. Green’s Functions— Spherical Geometry 369 


6. Green’s Functions— 
Spherical Geometry 


We have already noted [Eq. (3.170), Chapter 3] that the solution of a 
linear equation for a spherical shell source may be written in terms of that 
for a plane source. This connection between plane and spherical problems 
may be seen in another way. The Laplacian of a spherically symmetrical 
function @ may be written 

yg = 2b , 2.86 


ar? ror 
arr (rd) (6.59) 


Therefore, in the case of spherical symmetry, the heat conduction (diffu- 

sion) equation is 

d(r—r') 
4ar’”’ 


or Le (rT ) => 6(t) ——_—— (6.60) 
when there is a unit instantaneous source at time t = 0 on the sphere of 
radius r’. The normalization on the right is such as to make the total 
strength of the shell source unity. 

If we introduce 


X=rT (6.61) 
Equation (6.60) becomes 
Ox 0x a 
YP 52 8(t) 7 77 (rr ) (6.62) 


which is just the one-dimensional os However, since J must be 
finite at r= 0, there is the additional “boundary condition” that x = 0 at 
r=0Q. Thus, the problem of a finite sphere is mathematically identical 
with that of a “one-dimensional”? medium confined between two planes, 
at r= 0 and r= a, respectively, the former being kept at the fixed tem- 
perature JT = 0. We may, then, immediately write down the solution of a 
number of problems of spherical geometry in terms of ones already 
obtained. 

It should be noted, however, that it is not simply a matter of the 
relation between plane and shell source solutions in an infinite medium, 


Wen = Glo (Ir — 7’ |) —dpi(r+r’)] (6.63) 


io 


Although there is a correspondence between plane and shell source solu- 
tions in finite media, that correspondence must take account of boundary 
conditions. We must recognize that it is rT in the spherical case which 


370 Problems of Diffusion and Attenuation 


replaces JT in the plane one. If we are interested in a sphere of radius a 
whose surface is held at a fixed temperature, that temperature may be 
taken to be zero; in this case the problem corresponds to that of a plane 
source in a finite slab bounded by z = 0 and z= a, with both surfaces at 
T = 0. Consider, however, the case in which the outer surface of the 
sphere is insulated. Then 


“-—~0 at r=a (6.64) 


or 


x’ —1, =0 at r=a (6.65) 


This has the form of a radiative boundary condition for a plane problem, 
so the analogous problem to be solved is that of a slab with T = 0 at 
z = 0 and aradiative boundary condition at z = a. 

If the boundary condition at r= a in the spherical problem is truly 
radiative, the same analogy applies, though the constants in the radiative 
condition are altered. If 

oT 


rr aT =0 (6.66) 


at r = a, in terms of x this becomes 
x'—x(4-a) =0 (6.67) 


Since we have not previously solved this problem, let us sketch its solu- 
tion briefly here. 


For t ¥ 0 and r ¥ r’, and separating variables as in (6.44) by putting 
xX = g(r) f(), we find that 
g=sin~Ar (6.68) 
f=e6t (6.69) 
where, by virtue of the boundary condition (6.67) at r= a 


AacotAa = l—aa (6.70) 


Figure 6.2 shows the roots of the transcendental equation (6.70). 
The solutions (6.68) are normalized as follows: letting 7, = A,(1/r) 


6. Green’s Functions— Spherical Geometry 371 


Aa cot Aa 


Figure 6.2 
sin A,’, we must have 


a a 
Ai f : t,7r dr = 417A,” J , sin? A,r dr 


a 
= 27A,? J, (1—cos 2A,r) dr 
= 27A,? [a—(1/2),) sin 2A, a] 


=] 
Therefore, 
7 ee eee 
V (a7/d,)(2A,a — Sin 2d,,a) 


The solution of (6.62) may now be obtained by writing 
X= 2 C,()A, sin Ayr 


and making use of the fact that 
1 gS e e v 
tn i(r—r') = »Y A,? sin \,r sin A,r 


Substitution in (6.62) gives the equation for C,,(t), 


sin A,r’ 
Cat BdyPCy = (Ag 
Thus 
Cc. = Aa eBAnt 


r 


Aa 


(6.71) 


(6.72) 


(6.73) 


(6.74) 


372 Problems of Diffusion and Attenuation 
and the solution becomes 
T= i X= + Sy Ane Pn" sin Agr’ SiN Apr (6.75) 
r rr’ ” n n 
nr 
where 4,” is given by (6.71). 


7. General Problem with Initial 
and Boundary Conditions 


We have shown in the previous sections how to determine Green’s 
functions for heat conduction and diffusion problems with different geom- 
etries. We now see how to use the Green’s function to solve such prob- 
lems subject to given initial conditions: 


T(r, t) = To(r) at t=0 (6.76) 


and boundary conditions which specify 
—-—yT (6.77) 


over given surfaces S. d/dn represents the normal derivative over the 
surface. Particular cases of (6.77) are those in which the temperature or 
flux of heat are specified over S. 

Before proceeding, however, it is necessary to prove the “recip- 
rocity theorem”’: that the solution at r due to a unit source at r’ 1s the same 
as that at r’ due to a unit source at r, the time interval being the same in 
each case. In other words, we wish to show that 


G(r,r’,t—t') =G(r',r,t—t) (6.78) 


Problem 6-9: Harold Urey, in his book The Planets? 
poses the problem of the change in temperature due to surface cooling of 
a sphere which is heated uniformly by radioactive heating. The equation is 


oT oT 20T _ 
ot « (5+ r ar) +> Oie~*! 


Q; is the heat produced per unit volume by the ith radioactive species, 
divided by the specific heat C; a; is the decay constant of that species. 
Assuming that the temperature in the sphere is uniformly J, at t = 0, and 


2New Haven; Yale University Press, 1952, pp. 52-53. 


7. General Problem with Conditions 373 


that the surface is at temperature JT = 0 at all times, show that 


(—1)”_. mar, _ 
a i ee plied dee at — p—(Kn272t/a2) 
=*2/5 5 > i= Cicn2a?/a) sin Z le e ] 


nT 
i n=] 


as fs y (—1)" sin nw en (Kn2n2t/a?) 
" nt a 


a being the radius of the sphere. 


That the solution is a function only of t—?’ is a consequence of time 
translation invariance. Suppose it is a function of “source time’’ ¢’ and 
“observation time” t. It may equally well be written in terms of t, = (t—t’) 
and ¢, = (t+ ?¢'). But if the origin of time is displaced by amount fo, ¢, is 
unchanged and ft, becomes 3(t+1t')+ 1%. Since, due to time translation 
invariance, the solution cannot depend on fo, it must depend only on ¢;. 

In an infinite medium one may similarly show from invariance under 
spatial translation that G must depend on coordinates only through r—r’. 
Since spatial translation invariance breaks down in finite media, the result 
does not follow in that case; this is verified in the problems considered. 
In each case, however, the relation (6.78) holds. 

The proof of the reciprocity theorem is as follows: From the defini- 
tions of the Green’s function 


a G(r,r’,t'—t) —BV?G(r,r’, t’ —t) = &(r—r')8(t' —2£) 
fort’ > t,or 
-2G(r, r’,t'—t) —BV°G (r,r’, ! —t) = 8(r—r')8(t'—2) (6.79) 
But fort > ¢’ 
2 G(r,r',t—t’) —BVG(r,r',t—1") = 8(r—r",t—2") (6.80) 
We now multiply (6.79) by G(r,r’,t—1?") and (6.80) by G(r,r’,t'—d, 
subtract the former from the latter, and integrate with respect to r over 


the region of definition and over ¢ from below 2?” to above t'. The left- 
hand side becomes 


t+ 
| arr = {G(r,r’,t—t')G(r,r',t' —1)} dt 


t'~ 
+6 | at | ds {oe rte) Ger, r’,t’—1?) 


rogt_ oO og gt 
G(r,r’,t tha Grr ,t rn 


374 Problems of Diffusion and Attenuation 


The first term is zero because one of the G’s is zero at each limit, and the 
second by virtue of the boundary condition of the form 


on the boundary of the region of definition of the problem. But the right- 
hand side is then 

G(r'’".r’,t'—t')-—Gr',r’,t—r')=0 
so the reciprocity theorem is proven. 

Let us return now to the problem of solving the equation, in a speci- 
fied geometry, subject to the conditions (6.76) and (6.77). For generality, 
let us also suppose there is also a source distribution Co(r, £), i.e., that 
the heat energy produced per unit time in the volume element d?r is 
Co(r) d*r, where C is the specific heat. The heat conduction equation 
then becomes 


of —pv"T = o(r’, t’) (6.81) 


On the other hand, the Green’s function equation is 

Z G(r',r,t—1')—BV"G(r',r,t—1') = a(r—r')&(t—1') (6.82) 
or, in aform more convenient for our purposes 

-< G(r’, r,t—1')—BV°G(',r, t—1') = &r—r')(t—2’) (6.83) 


We now multiply (6.81) by G(r’, r, t—t’) and (6.83) by T(r’, ¢’), subtract, 
integrate over the spatial coordinates r’ and over ?’ from 0 to t+ « to get 


t 
Tir.) =— | ar" | dt’ Te’, 2)-3 Ge, r’,t—t') 
0 
+ G(r, rts Tir’, 
+6 f {Gr',r,t—21V?T —TV"G(r, vr’, t—t')} d3x'dt' 


+ f ot’. 1')G0r',r, 1-1) dr'dt! (6.84) 


We can do the ?¢’ integral in the first term on the right, and convert the 
second to a surface integral over the boundaries. In this way we obtain 


T(r,t)= f dr! To(r')G(r, ', 1) 
’ _ 7! oT 0. ’ _ 7! ' ! 
+8 {owe Nya TSG rt r)} as dt 


+ J o(r’, t')G(r',r,t—1’) ar’ dt’ (6.85) 


7. General Problem with Conditions 375 


If (07/dn) — aT is given on the boundaries, the problem is easily solved by 
using the Green’s function satisfying 


on them. For then the second term on the right is 
’ __ #! oT =— hoyg_ gl oT ' 
p| Ge',r.t 1) [3 at | as =p [Gort aE ar | ds 


the integral being taken over the bounding surfaces. Thus (6.84) finally 
becomes ) 


T(r,t) = f dr’ To(r’)G(r,r',1) 
t 
+e{ ar [ Ger t—1) [5 T(r", t)—aT(e',1') | ds 
0 
+ fo dt’ [ o(t'.t')G(r,r.t—1') dr (6.86) 


We see, therefore, that the contributions from initial conditions, from 
boundaries, and from sources are all expressible in terms of the same 
Green’s function. 

Care must be taken, however, for the case in which T is given on the 
boundary. If we use the Green’s function for which G = 0 on the boun- 
dary, we see from (6.85) that the boundary contribution to the solution is 


t 
’ _O-  g__ yl ' 
p | ae | T 577 O (FoR at t') dS 
where the integral is taken over the boundary, andall?t’ < t. 


Problem 6-10: A sphere of specific heat C and radius 
ais heated by a uniform source of strength S in its interior, and radiates 
freely into its surroundings which are maintained at a fixed temperature 
To. Show that its equilibrium temperature distribution is 


Pac er 


where the A,’s are the solutions of the equation tan A,a = dA, a/(1+ya) 
and the f,,’s are 
____1 4ya@’S___cosd,a 
fa= KA,? 1+ ya 2A,a—sin2),a 


Problem 6-11: The temperature at the surface of a 
semi-infinite medium is varied with time, T = T)(t). Assuming no other 
sources, determine the temperature inside the medium. Consider the 


376 Problems of Diffusion and Attenuation 


special case of a periodic variation 


Determine at a depth x the phase lag of the temperature variation com- 
pared with that at the surface, and the decrease in the amplitude of the 
variation. Taking for ordinary soil « = 2 X 10-? cm/sec, apply the results 
obtained to the diurnal and annual variations of the earth’s temperature. 


8. Green’s Functions and the 
Free-Particle Schrodinger 
Equation 


There is a close formal similarity between the equations of diffusion 
or heat conduction and the time-dependent Schrédinger equation for a 
free particle, and this similarity makes it interesting to consider the 
Green’s function for the latter problem. The Schrodinger equation Is, as 
will be shown in Chapter 8, 


Ob _ _ fo, 
ih a am & (6.87) 


where f is 1/(27r) times Planck’s constant and m is the mass of the particle. 
Defining the Green’s function for the equation by 


9G cm 2 — 7. —_! _ 3! 
ih ry +5 V2G = ihd(r—r’')d(t—?’) (6.88) 
or 
9G _ ih pee Lge p')5(p—7! 
ann) V7G = 6(r—r’ )6(t—-t’') (6.89) 


we note the similarity with (6.20). The real constant 8 has been replaced 
by the imaginary one ih/2m. From (6.25) we then write down the Green’s 
function for the Schrodinger equation in an infinite medium, 


— ami 3/2 
et pe) om —(m|r—r' |?/2i (t—1')) 
G(r—r',t—t') Geen) e (6.90) 
The most interesting application of this result is that it enables us to follow 
the history of the particle, knowing its wave function at an initial time. 
This is given by a simple adaptation of (6.86): 

wir, t) = f dr'p(r',')G(r—r',t—1) (6.91) 


A particularly simple example is that in which the particle has initially a 
Gaussian distribution: 


3/4 
Ws (r’ , 0) = (*) e761 2)ar’2 (6.92) 


8. Green’s Functions and the Schrédinger Equation 377 


where the constant has been chosen so that J Ww? dr = 1. Then 


a\3/4 (am \3l (1/2)0r"2 9 —(mir—r' |*/2i8t) 3p! 6.93 
r.1) = (2 —_— e~ e7 mir—r’|*/2ihke ir . 
(r,t) T sm) 
3/4 3/2 
— (& m_\3  _ nr2/2iit) —(1/2)0r!2 ,— (mr'?/2iit\y — (mee Jit) 3p! 
— ~——]| e e e ¢ r 
7 2Tiht 


The integral 1s 


fo 0) ba t 
~[U2)e—(impn)yr? SIN CUnrr'/ht) 1. 1, 
e ; r 
au | _ (mrr' /ht) dr 


‘00 
= aM | e — (1/2) [a—(im/fr) ]r’? (e imi — e—imrr' jit) yp! dr’ 


ae) 
es] 
_ arnt | e7 (U2) [a (im|ht) Jr’? gimrr lhe r’ dr’ 


—0 


2 [> 0} 
= 27 ht lo e7 (2) [a= (impr) Yr’? eimrr' ht dr’ 
im} ror 


The integral may now be converted into a Gaussian integral by completing 


the square. When this is done, and the result is substituted into the ex- 
pression for ws (r, ¢), itis found that 


_ (a\?4 | 3/2 —tar? _iaht 
¥ (r,t) = (2) (i<Gakam) “XP T+ (a?h?1?/m?) (1 m ) (6.94) 
The magnitude squared of this quantity, which gives the probability 
distribution, is 


_ a 3/2 —ar2 
WOOP =| eT | exp | aa} (6.95) 


Thus, the distribution remains Gaussian but broadens with time. The 
peak also decreases, the area under the curve remaining constant. The 
time required for the width of the distribution to double is given by 


22242 
a nt 3 
so that 


t=—— (6.96) 


Problem 6-12: Determine (r,t) when «(r,0) = 
(a/7r)3/4e—G/ ar? oik-r and show that in this case the peak of the distribution 
moves with velocity Ak/m but otherwise behaves as in the case treated in 
the text. 


378 Problems of Diffusion and Attenuation 


Problem 6-13: By using the Fourier transform 
method, derive the formulas (6.90) for the Green’s function, and Eq. 
(6.94) for the spread of a Gaussian wave packet. 


The problem may also be treated by means of Fourier transforms in 
both the space and time variables. Letting ¢(k,w) be the transform of 
iG (r,t), 

ee 
8(K, ©) = Gam) —o 


Since the wave function at time f¢ is the spatial convolution of the wave 
function at t= 0 and i times the Green’s function, if @(k, w) is the trans- 
form of &(r,t) and @o(K) is the transform of w(r, 0) = Wo(r), by the con- 
volution theorem for transforms 


_({1\> __ o(k) 
$(k, @) = (5) (ik?/2m) —@ 


Ambiguities arising from inversion in the time variable may be resolved 
by imposing the condition that the causality condition must hold [see 
Chapter 4, pp. 203-204]. 


9. Diffusion of Neutrons 


The elementary theory of the diffusion of neutrons in condensed 
media has been extensively studied in connection with the development 
of nuclear chain reactors. The problem is more complicated than the dif- 
fusion and heat conduction problems already discussed, due to several 
factors: 

(a) The neutrons are produced at high energies and are slowed down 
or moderated by collisions with the atoms of the crystal. One must, then, 
consider not only their total density but their density at each energy; that 
is, One must treat the moderation process. 

(b) Neutrons may be “captured”’ in the medium, that is, there are 
‘““sinks”’ (negative sources )proportional to the density. 

(c) If fissile materials are present, there will also be sources, again 
proportional to density. 

The treatment we give is a simplified and somewhat idealized one. In 
the first place, we deal only with homogeneous systems. Following usual 
practice, we divide the neutrons into two categories: 

(i) ‘fast’? neutrons which have not yet come into thermal equilibrium 
with their surroundings, and 

(ii) “‘thermal’’ neutrons, whose energy should have an equilibrium 
distribution (a Maxwell distribution) with mean energy? kp7 at tempera- 


3k, is the Boltzmann constant whose value is 1.38 X 10-!* erg/deg. 


10. Diffusion of ‘“‘Thermal’ Neutrons 379 


ture 7, but which we in fact consider together, ignoring their energy 
variation. 

The treatment of these “‘thermal’’ neutrons is the easier part of the 
problem. We treat them first, formulating the problem for arbitrary source 
distributions. When we subsequently turn to the problem of the “fast” 
neutrons, we will be able to consider those which become fully moderated 
as constituting the ‘“‘source”’ of thermal neutrons. 

We also confine our attention primarily to “‘steady-state” problems, 
in which the densities do not vary with time. The others are of course 
mathematically considerably more complicated. The steady-state prob- 
lems are, however, as we will see, already comparable with the other 
problems to be discussed in this chapter. 


10. Diffusion of “Thermal’’ 
Neutrons 


We start as usual, with the “‘equation of continuity”’: (rate of change 
of number of neutrons in a given volume) = (rate of production in the 
volume) — (rate of loss) — (efflux over the boundaries). Letting the number 
of thermal neutrons per unit volume be p, the rate of change of the number 
in a fixed volume V, is (0/dt) Siy,) p dr. Defining the “‘source density” 
So(r), the rate of production in the volume is f,y,)So(r) d’r. If the 
probability per unit time of “‘capture”’ of each neutron is 1/7, the rate of 
loss is (1/7) Jiv @°r. Finally, the rate of efflux is Jf, j-mdS, a normal 
surface integral over the surface oa) bounding the volume Vp. j is the 
thermal neutron current, that is, j: mis the number flowing over unit area 
perpendicular to the unit vector n per unit time. The surface integral may 
be converted by Gauss’ theorem into a volume integral over Vo, becoming 
Sve V -j@?r. If we then assume Fick’s law, 


j=—DVop (6.97) 


the continuity equation takes the form 


6) 
[. {22-s,+2-pvip} a=o 6.98) 
(Vo) ot t 


Since this is true for an arbitrary volume, the integrand must be zero 
everywhere, which yields the diffusion equation 


ap 


P_ py2p = 
aves DV?p = So (6.99) 


380 Problems of Diffusion and Attenuation 


Problem 6-14: Show that the — infinite-medium 
Green’s function (point source solution) for Eq. (6.99) is 


—p’ = =e an —tlr p—r27/4L2t 
G(r-r’,t)= Ans et e 


where the ‘“‘diffusion length” L? = Dr. 


It may be shown by more detailed and rigorous analysis that Fick’s 
law is valid only so long as the angular distribution of neutron directions 
at a given point is not too anisotropic. It will thus not be valid near local- 
ized sources (where almost all neutrons are flowing away from the source), 
or near boundaries, where almost all are flowing outward. The first re- 
striction is not too important in the problems we consider. As for the 
second, it may be shown once again that the error introduced by the 
assumption of Fick’s law is appreciable only very close to the surface 
(within a distance comparable with the “scattering length’, the mean 
distance travelled by neutrons between scatterings); the density further 
in the interior of the medium may be correctly determined by making a 
small change in boundary conditions, corresponding to the introduction 
of an artificial boundary slightly outside the actual one. We, therefore, 
do not concern ourselves further with this point. 

The “‘steady-state”’ equation is obtained by putting the time-variable 
term equal to zero. Furthermore, let us divide through by D, and put 


Dr = L? (6.100) 
and 


S = S$,/D (6.101) 


so that the equation takes the form 
V7p—-L =-5 (6.102) 


This is now a steady-state diffusion equation with the added feature of 
the dissipation term —p/L?. The dissipation aspect is one which will 
characterize all subsequent problems to be discussed in this chapter. 

The quantity L defined by (6.100) has the dimensions of a length, and 
characterizes the scale of variation of the density. It is often referred to 
as the “‘diffusion length.”” The quantity V2p characterizes the “‘curvature”’ 
or “buckling”’ of the distribution, which is then determined by the source 
distribution, which tends to increase the current, and the dissipation, 
which tends to decrease it. 

It is a trivial matter to verify that the Green’s function for the one- 


10. Diffusion of ‘““Thermal’’ Neutrons 381 


dimensional problem in an infinite medium 


Ge) — Fp Gola) = — (2) (6.103) 


G,(z) = =e“ (6.104) 


By using Eqs. (3.168) and (3.170), we may derive the “‘point source”’ and 
“shell source”? Green’s functions in an infinite medium: 


Gy(r) = enh (6.105) 
and 
G3 lr, j= 3 7 \ Ce — e7lr+rliL] (6. ] 06) 


For finite media with a linear geometry or a spherical geometry, the 
method of images may be used to obtain the appropriate Green’s func- 
tions. In neutron diffusion, the most interesting problems are those in 
which there is free efflux over the boundary. The boundary condition in 
this case is that the density goes to zero at the “artificial boundary” 
which is only a fraction of a scattering length outside the real one. This 
scattering length is in general only a minute fraction of the dimensions of 
the media being considered. 

In the case of a plane source at z = z’ in a medium bounded at z = 0 
and z= a, the Green’s function can be written in terms of an infinite 
series of images 


G(z,z') = 5 [Go(z—z' —2na) —Go(z+z' —2na)] (6.107) 


By Poisson’s formula (4.82) this may be put in an alternative form, in 
terms of the Fourier transform go(k) of Gy(z—z’): 


&o(k) = PP eine! (6.108) 
° 27 1+k?L? " 
The Green’s function (6.107) may then be written in the form 
ne PTE LLLE PE OELLLE 
G(z,z') = > 1+ (n®a®La8) sin 7 Sin (6.109) 


Problem 6-15: Find the Green’s function for a 
source at point x;, y,, Z:, in the rectangular parallelepiped bounded by 
x=0, a; y=0, b; z=0, c; (i) as a Series of images and (ii) in a Fourier 
series form. 


382 Problems of Diffusion and Attenuation 


Problem 6-16: If, for each neutron captured, k 
new neutrons are produced, show that the steady-state equation for a 
reactor is 


Vp +a p=0 


Assuming that p = 0 on the surface of a spherical reactor of radius a, 
find a such that the density is constant for a given multiplication constant k. 

If we consider a reactor like the above surrounded by a nonmultiply- 
ing medium with diffusion length L’, where 


k—1 1 
i <1? 


calculate approximately the reduction in a for a given k if the reactor 
remains steady. [Assume p, (dp/dn) continuous at the boundary between 
reactor and sheath. ] 


Similarly, the Green’s function for a shell source at r’ in a spherical 
medium of radius a is, in terms of images, 


z [ea!7—r +2naiL — e-ir+r'+@naib} = (6,110) 


n=—© 


G,(r,r’) = 


Atrrr’ 


or in terms of Fourier series, 
1 . nar’. nar 


[2 
2arr'a >» 1+ (n?7?L2/a?) sin sin (6.111) 


G,(rr') = 


These solutions complement each other in that the first converges rapidly 
fora > Land the second fora < L. 

A case of some interest in practice is that of a cylindrical medium of 
radius b and height h, that is, extending between z = 0 and h. Since we do 
not know how to do this by the method of images, let us do it directly by 
the method of expansion in series. We assume a series solution of the 
form 


G= 


l 


Ss 24 "nnd m( dim r) eime sin 7 (6.112) 


1 m=0 n 


Jm is the Bessel function of integer order m, and the A,,,’s are its zeros. If 
the unit source has coordinates (r’, ’,z’) it may be represented by a 
similar expansion. 


F8(r—1')8(O—$')8(z—z’) => AmaI m (Aim £) etme sin 
(6.113) 


11. Moderation of Fast Neutrons 383 


Using the orthogonality of the expansion functions, we find that 


2 t t 
Qimn FI (Nim) J m+ (Xm) th = Jn (rin “) eid’ sin a 


or 
it 
Tra’h Jim—1¢ drm) J mil dim) m 


Substituting both (6.112) and (6.113) in the Green’s function equation 


Qim = —2 (rin “ e~im®’ sin a (6.114) 


Vp f= —(r-1) =F lr—r'H(G—$')HZ—z’) 6.115) 


we obtain the equation for A snp, 


— Qimn 
Aimn = OXF a) + (nar?/h) + U/L?) (6.116) 
so that the Green’s function is given by the expansion 
~ 2 yt 
G= tra*h 2 TraimTmri hen) 
Pr \ Hime! gin REZ FY Uime Gin WHE 
X Jin (Aim) € sin I Im ( Nim rye sin 
(6.117) 


* (xp, /a®) + ea] h2) + AIL) 


It is clear that, because of the Bessel functions, this is not in a form which 
will permit the application of the Poisson formula to obtain a solution in 
terms of image sources. 


11. Moderation of Fast Neutrons 


The process of slowing down of neutrons by collision with atoms of 
the moderating material is a stochastic one, and hence the moderation 
process should, strictly speaking, be treated by the theory of probability. 
We should, for instance, to be precise, deal with the probability that a 
neutron loses a certain amount of energy in a given time. We simplify 
matters, however, by dealing only with averages, ignoring the fluctuations 
for individual neutrons. That this is justified will be seen, from the theory 
to be developed in the next chapter for stochastic processes, to be 
assured by the great number of particles being considered. 

Elementary dynamics tells us that, in a pure medium, the average 
fractional energy loss per collision is independent of the energy (or 
velocity) of the particle. In the time interval (t, t+ dr) the fractional energy 
loss —(dE/E) = probability of a collision P,(E)dt, times the fractional 
energy loss per collision f. [P,(E) is the probability per unit time of a 


384 Problems of Diffusion and Attenuation 


scattering collision at energy E.] Thus 
—& = fPJe) dt (6.118) 


This equation establishes a unique relation between energy and time, 
which is completely determined if the initial energy is known. Thus, 
instead of specifying the neutrons by their energy, they can be specified 
by their ‘“‘age’’— the time interval t required for them to reach that energy. 
We designate this age as é. 

Let us now specify that y(r, t, €) is the density at time ¢ of neutrons 
in the age interval (€, €+ dé). Since the age é of each individual neutron 
increases in proportion to time elapsed, x is also the number of neutrons 
reaching the age é per unit time. 

x must of course be determined by an equation of continuity. We 
derive it from the statement that the number of neutrons of age (€, € + dé) 
at time ¢ in a volume V is the number at age €— df at time t— dt, minus 
the number of that age which have flowed out over the boundary, minus 
the number unproductively captured in the medium, plus the number at 
that age produced in the volume per unit time. 

Introducing the notations: 

J(r, t, €) = current density of neutrons of age (€, € + dé) at time t 

P(&) = probability per unit time of capture at age é 

oo(r, t, €) = source density of neutrons of age € at time ¢ 
We may express the equation of continuity in the mathematical form 


f x0r.t,€) dr dé = [ (0, t—dt, Edt) a®e dé 
—dt { I(r, t,&) -ndS dé 
—atP.(é) J x(r,t,€) d'r dé 
+dt [ oo(r,t, €) d°r dé (6.119) 


The volume integrals are over the volume V, the surface integral over the 
surface surrounding it. The fission processes produced by fast neutrons 
have been neglected. | 

Introducing again Fick’s law for fast neutrons 


J(r,t, €) =—D,(€) Vx(r, t, €) (6.120) 


and converting the surface integral into a volume integral, we get the 
differential equation 


9X, 9X_ pve = 
att ae DW?x+ P.x = G (6.121) 


A convenient simplification may be made by introducing the ‘“‘symbolic 


age” é 
9= J, D,(é) dé (6.122) 


12. Model of a Neutron Chain Reactor 385 


which has the dimensions of the square of a length. D, may then, in 
principle, be written as a function of 6. Equation (6.121) becomes 


Det Vet BE =o (6.123) 
where 
o = 22 (6.124) 
D, 


is the source density per unit range of symbolic age. 


12. Model of a Neutron Chain 
Reactor 


We may now develop a simplified model of a homogeneous chain 
reactor, based on the following assumptions: 

(1) The ‘“‘fast’? neutrons produced in the process of fission are 
described by (6.123), and reach “‘thermal’” energy at the symbolic age 
6= 6. (The assumption of a unique symbolic age for thermal energy 
implies that all fission neutrons are produced at the same energy—a 
convenient mathematical idealization. If it is not made, the analysis 
becomes considerably more cumbersome.) 

(2) The neutrons reaching this age are regarded as sources of thermal 
neutron density. As remarked above, the density x per unit age interval is 
the number per unit volume reaching this age per unit time; therefore, 
x(r,t, 89) is the source o> in the thermal neutron equation. This equation 
then becomes 

5 b+ Fv =F Xx(K tH) (6.125) 

(3) Since the “‘thermal” neutrons which are captured may give rise 
to fission, and therefore act as sources of fast neutrons (of symbolic age 
zero), the source term in the fast neutron equation in a reactor isolated 
from external sources may be expressed in terms of the thermal neutron 
density. 

In time-dependent problems, however, it is important to take account 
of the lapse of time between slow neutron capture and the creation of 
fission neutrons. The majority of these neutrons are emitted at the instant 
of splitting of the capturing nucleus. The time interval between capture 
and fission is very short on the macroscopic time scale; we therefore 
neglect it and assume that their contribution to the fast neutron source is 


r,t 
of, = Ole Je 


0 (6.126) 


386 Problems of Diffusion and Attenuation 


No is the average number of ‘“‘prompt”’ neutrons arising. from capture. (It 
takes account, of course, of the fact that some of the capture processes 
are ‘“‘parasitic”’ and do not give rise to fission.) 

There are, however, some fast neutrons which arise from the radio- 
active decay of fission products. Since these decays may be relatively 
long-lived, their contribution to the source does not depend on p(t) but on 
the thermal neutron density for earlier times. 

There will in fact be several such “delayed neutron” lifetimes: we, 
however, idealize the problem by considering only one. (Again, the 
problem is quite tractable without this assumption, but becomes formally 
more complicated. Our model is quite adequate to show the physical 
effect of the delayed neutrons.) 

Consider then, fission products with neutron emission lifetime T. 
Such fission products decay with time according to the law 


eT (6.127) 


Suppose that the number of such neutron-active fission products pro- 
duced per capture is n—n”, (so that n is the total number of fission 
neutrons resulting from a capture). The number produced per unit time at 
time ¢’ is then 
(n— no)p(r, t') 
T 

The probability per unit time that any one of them will decay at time 
t(> t') is (1/T) e“*-*"'T, Consequently, they contribute to the source at 
time ¢ aterm 

n—WnNpo 


t 
oo = | p(r’, t') t e—E-OIT dr! (6.128) 
0 
Here t' = 0 is a time in the past at which the system was put into opera- 
tion. Adding (6.126) and (6.128) and multiplying by 5(€) = (1/D,)8(6) we 
get the total source to be used in the fast neutron equation (6.123). This 
equation then takes the form 


| 9x 9X _ ve = 20) — [ ry! son 
D, att ad V2x . nop(F, t) + (n — No) prtie T (6.129) 
The equations (6.125) and (6.129) therefore constitute the basic equations 
for our model chain reactor. 

In the case of steady-state problems, these equations take the simpler 
form 

| 
V2p fi = —H X(F, 6) (6.130) 

and 


SX _ yay = 7 p(r)5(6) (6.131) 


13. The Problem of Criticality 387 


13. The Problem of Criticality 


We first use the time-dependent equations (6.125) and (6.129) to 
investigate when the reactor is subcritical and when it is supercritical; 
that is, when an initial distribution decays and when it “explodes.” At the 
same time, we determine the time scale for these contingencies. 

For finite systems, it is not really necessary to solve a particular 
equation for the spatial variation of the densities. Whatever the geometry, 
we can solve the eigenvalue problem 


X X ( ) 
subject to the given boundary conditions. 


Problem 6-17: (a) Show that, for a system in the 
form of a rectangular parallelepiped of dimensions a X b X c, the possible 
values of 8? are 


where /, m, and n are arbitrary integers. - 
(b) Show that, for a cylindrical reactor of height / and radius R 


Pa an 
Oe TR 
where 
Jin Qmn) = 0 


land m being integers. 


Each spatial mode determined by (6.132) has its separate evolution 
in time, determined by diffusion equations. If normalized solutions of 
(6.132) are designated by f, solutions for the densities may be found in 
the form 


P= pol!) fa (6.133) 

X = x09, fe (6.134) 
py and x» then satisfy the coupled equations 

i) 1 1 
522+ (B+75) Po = F Xo (t, Oo) (6.135) 
and 
1 8X0 , 8X0 5(0) [ 

— ef = t)+ —_— t’ —U-OIT ' . 
D, ot * 00 + B'Xo = Ny Po(t) + (n— No) _ Pol Je | (6.136) 


Let us now take Laplace transforms in time, designating the transforms 


388 Problems of Diffusion and Attenuation 


respectively, as fy(s) and Xo(s, 0). The initial values of py and x» are taken 
to be 


Po(0) =c (6.137) 
Xo(0,0) = y(@) (6.138) 
respectively. The transformed equations are then 
Cc. USL 1\_ 1 _ 
Ftp bot (B+ z3) bo =F Rols, 6) (6.139) 
and 
y(6) 8X0 5(8) n+nosT 
Di +p hot sy + BXe = —| Ter |e (6.140) 
Let us now define a Green’s function corresponding to (6.140): 
EOC) + (Bt 4+) 9(6,0") = 5(0-0') (6.141) 
It is then easy to show that 
Q" 
'\— 2 ry 
2g(6, 0’) exp |— B?(@—8@’) sf ital (6.142) 
The solution of (6.140) may be written in terms of this Green’s function as 


_ 1_ n+ NosT 


Xo 7 Po TH sp 819.0) 


g’ 
+ | PTE exp [-6*(0—0") —s{€(0) —€(0")}] 0" (6.143) 
1(0’) 
We have used (6.122) to introduce €(6). 
Substituting from (6.143) in (6.139) we find for p, the value 
- | 


Po Ss C*«aiCSYCOntmssT...n 
+nosT 

at 24 0 ral _ AT NoS Lf “er0-xe00) 
B I 1l+sT e 


ec lf? y(@) 2a at) a 
«{E+b D,(@’) P| B? (0) — 6") — st (69) —E (0 )}] ao} 
(6.144) 


In general the inversion of this formula to obtain P)(t) is a formidable 
task. However, we know from the Laplace inverse theorem (4.182) that 
the solution is a combination of exponentials whose exponents are the 
zeros of the denominator of (6.144). Criticality, for a given mode, then 
depends on having a zero with a positive real part. The physically evi- 
dent fact that the system cannot be critical if n < 1, that is, if less than 
One neutron is generated for each one absorbed, is reflected in the fact 


13. The Problem of Criticality 389 


that the magnitude, and consequently the real part of the last term in 
the denominator is positive for Re s > 0; the denominator therefore 
cannot be zero in this case. 

In any case, the last term in the denominator is the only one which 
can give rise to a negative real part, and it is largest when s is real. There- 
fore, the root with the largest real part has imaginary part zero. Thus, to 
look for the root which dominates the behavior at large t, we must look 
for a real zero of the denominator. To have a root with positive real part, 
it is clearly necessary that n be sufficiently large and/or #? sufficiently 
small. Now there is a root exactly at s = 0 when 


1+ B?L? = neo (6.145) 
This condition marks the onset of criticality. It may be written alternatively 
oa n= (1+ B?L?)e6% (6.146) 


A given system and mode becomes critical at this value of n, and is sub- 
critical for smaller n. Alternatively, for fixed n, criticality can ultimately 
be reached by decreasing £, that is, by increasing the size of the system so 
as to decrease the loss of neutrons over the surface. 

Since the right-hand side is a monotonically increasing function of B, 
criticality is first reached for the minimum possible value of 8. Thus, it is 
the lowest mode that most readily becomes critical. Even at the point at 
which this mode has become critical, the others are highly transient. Thus 
the density distribution is that of the lowest mode. 

It is possible to calculate explicitly the decay or growth rate of the 
density close to critical conditions. One merely expands the denominator 
for small s and calculates approximately the root. After some straight- 
forward algebra, we obtain 


Sieeree i Agtdersl Clam: Ss) 
tel (n— ny) T + n€ (8p) ] 


eye ee Oe 
T+ (1+7L?)[(1— (no/n)) T +€ (80) ] 


Since the density evolves in time as e*’, we see that the rate is determined 
primarily by the longest of three times: 

(i) the thermal neutron capture lifetime 

(ii) the slowing-down time 

(iii) the mean delay time for emission of fission neutrons after cap- 
ture [for a number of delay times it may be shown that the term 
(1—(n/n))T should be replaced by 


; Ss on; T; 


So 


(6.147) 


390 Problems of Diffusion and Attenuation 


the 7;’s being the delay times and the 6n,’s being the contributions to n 
from each group of neutrons]. Since the numerator is small, the “‘reaction 
time”’ 1/s, is ia fact large compared with all the aforementioned times. 

A model of a chain reactor proceeding through fission produced by 
capture of fast neutrons may be generated by taking 7 = 0 and replacing 
&(0)) by the mean time for capture of a fast neutron, in Eq. (6.147). We do 
not pursue this problem except to remark that, since the slowing-down 
time €(9)) is in such cases very short, the reaction time of the system is 
dominated by the delayed neutrons. Were this not the case, such a system 
would react too quickly for an automatic control system (e.g., through 
introduction of a capturing impurity) to work. The delayed neutrons, 
however, introduce enough ‘“‘sluggishness’”’ into the system to enable it to 
be kept under control. 

It should be noted that the criticality conditions can be deduced from 
the steady-state equations. For the steady state is that which marks the 
division between subcritical and supercritical. Therefore the condition of 
self-consistency between (6.130) and (6.131) in the case of the “‘lowest”’ 
mode (that is, the mode with smallest 6), determines the relation between 
multiplication constant n and dimensions for the onset of criticality. For 
larger n or larger dimensions, the system becomes supercritical. 


14. Solution of the Steady-State 
Equations 


We have already discussed the calculation of Green’s functions for 
the thermal neutron equation (6.130). Since the fast neutron equation 
(6.131) is identical with the heat conduction equation, Green’s function 
calculations for (6.131) may be written down directly by analogy with 
those calculated for that problem. 

Consider, as an example, a spherical (and spherically symmetric) 
system of radius a. Using (6.110), o may be written in terms of yx from 
(6.130): 


L a co 
= ——— ’ —Ir—r'+2na\/L__ py—Ir+r'+2nal/L , ’ 
p(t) =3 5 i} x(r’,4) > [e e ] 4ar’ dr 


a (6.148) 


This could then be substituted in (6.131) to yield an integro-differential 
equation for x. Similarly, the corresponding Green’s function for (6.131) 
could be calculated, and the result substituted in (6.130) to give an 
integro-differential equation for p. Neither procedure, however, leads to 
easily manageable equations. But if the Fourier series forms of the 
Green’s functions are used, the problem becomes quite tractable. It 
becomes equivalent, however, to a calculation in which both p and y are 


14. Solution of the Steady-State Equations 391 


expanded in appropriate Fourier series and the coupled equations (6.130) 
and (6.131) are used to obtain coupled equations in the coefficients. Since 
they are homogeneous equations, nonzero solutions are only possible 
when there is a quite specific relation between n and the dimensional 
parameters. Although a “‘steady”’ solution is possible for densities with 
the distribution of each Fourier component, those for the higher Fourier 
components will, as seen in the previous section, correspond to such a 
combination of high n and large dimensions that all lower components 
would be extremely unstable. 
The problem is in fact then one of obtaining the lowest 8 such that a 
solution of 
V2f+ Bf = 0 (6.149) 


satisfies the boundary conditions of the problem, viz., f finite at r= 0 and 
f=0atr= a. The solution is clearly 


1.2 
f= sin—= (6.150) 
with 
2 
2—_ (@ 
() (6.151) 
Putting 
P= pof (6.152) 
and 
X= Xof (6.153) 
we obtain for py and xo the equations 
1 1 
(1*+ 75) Oo = Fy x0(6) (6.154) 
and 
OX0 1 a2, — 2 
49 + BX0 =~ P08(9) (6.155) 


The solution of the second is 
x = Po e876 (6. 1 56) 


Hence from (6.154) the condition that py) be nonzero (that is, the condition 
for a stable steady-state solution) is 


B? + a = Dr e878 — a e860 


or 
a= (1 + B?L7) eh28u 


272 
_ (1 rau ) e728vla? (6.157) 


az 


392 Problems of Diffusion and Attenuation 


15. Diffusion Length, Slowing- 
Down Length, and Migration 
Length 


Some physical insight into the process of neutron diffusion may be 
obtained by considering point sources of neutrons and observing the 
distributions which result. This is, of course, what is given by the Green’s 
functions! We pay particular attention to the extent of the distribution set 
up by asource, which gives us information about the range of the neutrons 
in the material. 

For this purpose, we consider the nonmultiplicative case (n = 0). 
The multiplicative term in (6.131), being itself a source term, is suppressed 
to enable us to see the effect of a single source. 

We also consider, in the first instance, infinite media, so that the loss 
of neutrons over boundaries may be ignored. 

We note that, from the Fourier transform of the density, we may 
derive the spatial moments of the distribution, and in particular the 
second moment, which gives the mean square distance of neutrons from 
the source. If we take a density f(r) due to a source at the origin, (27)? 
times the Fourier transform Is 


Fo(k) = [ f(r) e-®* der (6.158) 
Then 
Fy(0) = [ f(r) d= My (6.159) 
is the integrated density and 
[iV Foluao = [ rf (r) d°r = M, (6.160) 
(1/M,)M, then gives the mean displacement from the source. Also 
— LV. Folio = f rif (r) dor = Mz (6.161) 


where (M,/M,) is the mean square displacement. 

If f(r) is spherically symmetrical, it is easily seen by doing the 
angular integration in (6.158) with k as axis that F)(k) is a function of 
k = |k| only. In this case M, = 0. Also M, is six times the coefficient of kK? 
in the Taylor’s series expansion in k of (27)°F)= f f(r) e-*'dr. 
[42 = M,/M, is the mean square radius of the density distribution. 

Consider now a point source of thermal neutrons at the origin, for 
which the density p is determined by (6.99) 


p—L?V2o = r8(r) (6.162) 


Taking Fourier transforms, and letting the transform of p be f(k), this 
yields 


p(k) = TEE (6.163) 
\ 


15. Diffusion, Slowing-Down, and Migration Length 393 


Then 
M,)=T (6.164) 
M, = 6L?r (6.165) 
so that 
po? = 6L? (6.166) 


The superscript ¢ on py designates thermal neutrons. 
Consider next fast neutrons, with density given by 


SX _ yay = §(0)8(r) (6.167) 
The transform x is given by 


OX pes 
ag 1 RX = 58) 


which has the solution 


X= eK (6.168) 
In this case 
M,= 1 
M, = 60 (6.169) 
and 
pp = 66 (6.170) 


Thus 68 is the square of the mean distance traveled by fast neutrons in 
reaching the symbolic age @. In particular 


pe = 60, — 6A,” (6. 171 ) 


is the mean square distance traveled in becoming thermalized; the length 
Ay defined by (6.171) is designated the ‘‘slowing-down” length. 

We may finally determine the mean square distance traveled by a 
neutron from its birth at symbolic age 6 = 0 to its capture as a thermal 
neutron. For this purpose we use the coupled equations (6.167) and 
(6.130). From the first the transform x is given by (6.168). The transform 
of (6.130) then determines p: 


I\. I- 
(12+ Zs) P= FH Xx(4) 


I 


= e *?80 


so that 
Te K7he 


0 TRE re 


Thus, considering the distribution of thermal neutrons resulting from a 


394 Problems of Diffusion and Attenuation 


point source of fast neutrons at the origin, 
Mo Si 


M, = 6(4 + L?)r = 6( Xo? + L?)r 
and 
[a = 6( Ag? + L?) (6.173) 


The quantity V 6(A,?+ L?) is known as the “‘migration length.”’ That it is 
the squares of the lengths and not the lengths themselves which are 
additive follows naturally from the incoherence of the two processes; on 
the average, the two displacements are orthogonal. 

Pursuing this line of investigation, one other problem is of some 
interest. It is that of the mean square extent of the distribution from a 
point source in a multiplying medium. This has to do, not with the dis- 
tance travelled by the original neutron, but by it and its descendants. 
The relevant equations are (6.130) and (6.131), the latter supplemented 
by asource term 6(r)5(6). Taking transforms of the two equations gives 


1 1 
(k? +73) P= Hx) (6.174) 
and 
OX eg = (8541) 800 (6.175) 
The solution of the second is 
x= (1 +2 p] ek (6.176) 
Substituting this in the first gives 
_ Te k*60 
0 Tone Pet PL SBE 
It then follows that 
ery 67 A ) 
Mo=- 7 and M, rit Ate 
so that 
Me = (22 +2") (6.178) 


We see, then, that the total number of neutrons tends to grow indefinitely 
as n — 1, and that the modified migration length also becomes indefinitely 
great. 


Problem 6-18: Show that the distributions of fast 
and thermal neutrons from a monoenergetic source of strength o> at the 


16. Electromagnetic Waves in a Dissipative Medium 395 


center of a sphere of radius a are 


= 90 Dy lAl 1en26102 
x Aa2y DL / Sin ae 
l=1 
and 
_ Oo & l ~ 127 2Gola2 oi» LUT 
a Dar ds 1+ (PaPL?/a) © og 
=1 
Use Poisson’s formula to find series solutions which converge rapidly 
fora > L. 


Problem 6-19: Show that the critical condition for a 
spherical reactor of radius a with a hole of radius b at its center is 


ex Bola? 


where x Satisfies tan x(1 — b/a) =—xb/a. Hence show that, for a definite 
n, it requires a greater amount of material for criticality if there is a hole 
in the material. 


Problem 6-20: For a given multiplication factor, 
calculate critical conditions for a sphere, a cylinder, and a rectangular 
parallelepiped. In the latter case, determine the relative dimensions so 
that criticality can be obtained with the minimum volume of material. 
Compare the volumes needed for the various shapes. Show that, for 
sphere, cylinder, and cube, respectively, they are in the ratio 1 :(9 V3a,2/ 
87?) : (93/47), a being the lowest zero of the zero-order Bessel function. 


Problem 6-21: A monoenergetic point source is 
placed at the center of a nonmultiplying cylindrical medium of radius R 
and height h. Find the densities of fast and thermal neutrons as func- 
tions of position. 


16. Electromagnetic Waves ina 
Dissipative Medium 


The problem of electromagnetic fields in strongly conducting media 
leads to equations which are rather similar in mathematical form to 
those of heat conduction or diffusion. Consider Maxwell’s equations in 
a conducting medium, and with external charge and current densities 
po and jo: 


V- E=~p, (3.14) 


V-H=0 (3.15) 


396 Problems of Diffusion and Attenuation 


5 OM 
VxE= ry (3.16) 
and 
oF. 
VXH= oE+e—-+ jo (3.17) 


Using (3.15) and (3.16) to introduce potentials, as in ($.276) and (5.278), 
we derive the equations 

V2%6—V- oa =2 (6.179) 
and 


ViV-A)—-V2A = —on (2+V4)— eu (B+V6)+i (6.180) 


If we take the Lorentz condition in the modified form 
V-A+oup+eu = 0 (6.181) 


we find that the equations for the potentials are 


—V*bt+ou 88+ ey X93 = - (6.182) 
and 
JA 307A : 
—WAt+ op + Ee — = do (6.183) 


If the conductivity is large enough that the third term is negligible com- 
pared with the second, we have equations of the diffusion type. If, then, 
for example, a charge is introduced at some point in the medium, it 
dissipates in much the same way as heat does in a thermally conducting 
medium. 

More precisely, however, the problem stands intermediate between 
one of wave propagation and one of diffusion (or dissipation). To get a 
better understanding of the situation, let us solve for the Green’s function 
defined by 
0G 


5 = Or) aH) (6.184) 


—V’*Gt+op 26 + eu 


Let us take Fourier transforms in both space and time variables, designat- 
ing the double transform G,,(K, w) as 


Gis (sn) | Gir, t) ent dr dt (6.185) 


The inverse transform is then 


G(r, t) = J GoAk, 0) et dk de (6.186) 


16. Electromagnetic Waves in a Dissipative Medium 397 


The equation for the double transform is 


4 
[k? —iopw— euw?] Gy = ( (6.187) 


Qa 
Let us first invert only with respect to the space coordinates, to determine 
Gr,w) = { Galk, ow) e** dr 


— LL 4 I _ pik-r 3 
7 (sz) | k?— opiw— enw? © a’k (6.188) 
_ i {* I sin kr ,, 
~ 473 | > RP-—cphio—euw? kr Kt dk 
1_ f° Xsin ke a (6.189) 


~ Sarr | _, eke? 

where 
ky? = enw’? + iow (6.190) 
Writing sin kr as the sum of complex exponentials of positive and nega- 
tive exponent, the integral may be evaluated as a contour integral to give 


1. 
Gr, @) = 3,2, etkor (6.191) 
Taking the inverse transform in the time variable 
| o 
G(r, t) = 37 | . ei Vehu +iokw e ie day (6. | 92) 


For the limiting case of zero conductivity, we obtain again the result of 
(5.93) in the preceding chapter, 


Arr c 


Gir,)=—- 8 (:—<) 


In the limiting case « = 0 the equation is seen, from (6.184), to be 
identical with that of a diffusion problem. While e« = 0 is not physically 
realistic, this limit does give an approximation to a very strongly con- 
ducting medium. The corresponding Green’s function Is 


G(r, t) = _ | eit Vidkw e iat dw 
If we make the substitution 

iw = &? (6.193) 
this becomes 


G(r, = 


ai | eitVOHE 9-€H & de (6.194) 


398 Problems of Diffusion and Attenuation 


The contour is that shown in Fig. 6.3. Since the integrals over arcs such 
as C, and C2, with radius becoming infinite, are zero, this contour may, 
however, be deformed to one along the real axis. The integral is then 
readily evaluated [see (6.24) and (6.25)] to give 


3/2 
G(r,1) = an ($e) e-rowlat (6.195) 


It is also possible to derive explicitly the general form G, starting 
from (6.192). Putting 


=F (6.196) 
the Green’s function may be written 
Gr, t= —_ | - geile tise? ee dos (6.197) 
Let us first make the substitution 
o+in=F (v2) (6.198) 


The path of integration in the v plane follows a curved path above the 
real axis from 2i — © to 2i+ © and (6.197) becomes 


—t 2i+e 
cinoma few aLeled) e244) 
2i—e@ 


eto 9g i+ oO 1 
“Ei2 [oi Blee- elo] oom 


2i—« vD 


At this point we must distinguish the cases (r/c) > t and (r/c) < t. In the 
former case, the contour may be deformed around the upper half-plane, 
being closed in the usual way by a semicircle of infinite radius. The 
integral on this semicircle goes exponentially to zero. Since there are no 


Figure 6.3 


16. Electromagnetic Waves in a Dissipative Medium 399 


poles within the contour, the integral is zero. Thus 
G(r, t) = 0, r>ct (6.200) 


Consider, then, the second case. It is convenient to make the 
substitution 
ct—r 
ctt+r 


Z—=0vU 


(6.201) 
which leads to the equation 


Cit c a iV(ct=T)(Cl+7r)+0 iQ _ re 1 
Ot. = ari ar J iversmern—= (-5) , a(-2)¢ Stet r) 


(6.202) 


where S = 1 for positive argument and zero for negative argument. The 
contour may now be deformed around the lower half-plane. From Eq. 
(4.390) we then obtain for G(r, ¢) the explicit form 


G(r, t) = ERGY Vt2—r/c?) S(ct—n)] 


tar ar” ar 
en sit J,GQ Vt? — r?/c?) Vv t* — ric?) e _ 
=e NS ta eo at (r/c)) (6.203) 


Problem 6-22: Verify that in the limit o > 0, Eq. 
(6.203) reduces to (5.93) and in the limit e — 0, to (6.195). 


Problem 6-23: Show that the field @ generated by a 
source density S,(r, t), whose value and time derivative are given at t = 0 


and for which a(d¢/dn) + Bd is given on specified surfaces, is determined 
by the equation 


t 
d(r, t) = I, dt' J dr' §,(r, t')G(r,r’,t—t') 
+op J d’r’ G(r, r’, Dd(r’, 0) 


+ eu | dr! Gur, r’,t) (35) tec, r, a(t’, 0)| 


+i tf dt’ ds’ jase; br, t') + Bb r,t ) G(r,r',t—1’) 
where the Green’s function satisfies the boundary condition 
dG _ 
a + BG = 0 


over the bounding surfaces. 


400 Problems of Diffusion and Attenuation 


Problem 6-24: Show, using the result of the previous 
problem, that a field #(r,0) in an infinite source-free medium becomes, 
after a sufficiently long time f, 


o(r,t) = 5 (fy | ow, 0) e-(our—r'f4t) 3p’ 


17. Propagation into a Conducting 
Medium 


The Green’s function for the infinite medium is useful for an under- 
standing of the physical processes involved in a problem; it may also, 
with the aid of the method of images, serve as a basis for calculating 
Green’s functions for finite media. However, the method of eigenfunction 
expansions is, for problems of finite media, both more direct and more 
versatile. 

In many wave propagation problems, we are not concerned with the 
sources, and are interested only in solutions in the source-free region. We 
may then use the homogeneous versions of (6.182) and (6.183) with 
right-hand sides equal to zero. It is then easily seen that the fields E and B 
themselves satisfy wave equations of the form 


dE 07E 
—V2 bdo = 
V E+on2 ry + eu af 0 (6.204) 
or, using the notation of the previous section, 
QoE , 1 d07E 
2 —_ —__— — 
—V E+ a5 ay ae a2 0 (6.205) 


Furthermore, since j = cE, the same equation Is satisfied by the current. 

Let us consider first the problem of a plane wave, impinging normally 
on the plane surface z = 0 of a conducting medium lying in the region of 
positive z. Let the circular frequency of the wave be w, so that in the 
medium 


E = E,(z) e~™ (6.206) 
where E, is a vector in the x — y plane. Then, from (6.205) 
d2Ey) . w*+ 2iwQD 
a a E, = 0 (6.207) 
The coefficient of E, may be written 
w+ 2iaD oe 


c ye (6.208) 


18. Skin Effect— General Considerations 401 


where 
1 _@Voe't 40? (6.209) 
2 Cc? 
and 
tan 2y = 2 (6.210) 
@ 
The solution of (6.207) which remains finite as z > ©1s 
E, =— E, eil0s V(Zz/A) e7 sin (2/)) (6.21 1) 


The wave then decays inside the medium with an attenuation length 


_ A 
sin y 


Xo (6.212) 


This length is also referred to as the “skin depth.’ By simple algebra it 
may be shown to be explicitly 


ae eee 1 ee (6.213) 
Veo Vat PACE — o? 


For frequency w < Q this is approximately 


Cc 


=~ (6.214) 
- Vall 
while for high frequency (w > () it becomes 
No © o (6.215) 


and is then independent of the frequency. 


Problem 6-25: An oscillating current of circular 
frequency w flows along a cylindrical conducting wire of radius a. Find 
how the current is distributed over the wire; show, in particular, that if 
w < ( and a is sufficiently large it decreases approximately exponentially 
from the surface with a decay length (skin depth) 


a l 
VOW Vicpw 


18. Skin Effect— General 
Considerations 


Consider a conducting medium occupying a region surrounded by a 
nonconducting one in which there is an arbitrary field. If w 1s a potential 
(either @ or a component of A), or the component of a field, it satisfies the 


402 Problems of Diffusion and Attenuation 


equation 
_Vv2 ow orp _ Ob A 
V-ptou 37 1 EM 32 72 OM ae at (6.216) 


where A = 1 in the external region, zero in the conducting part. We may 
now treat the right-hand side as a source term, and so write the equation 
of the problem in a form involving Green’s functions: 


t rot 
w(r, t) =on | dt’ [ SP aweg—rye-r) d®r’ (6.217) 


where G is given by (6.203). The integral on the right is, however, 
confined to the region outside the conductor. 
Consider fields with circular frequency w, that is, of the form 


w(r, t) = eho(r) (6.218) 
Then 


Wo(t) =—iwou f dt’ f do(r' eG (r—r', 1-1 )A(r') dr’ 


But the ¢’ integral is just 27r times G (r —r’, w) as given by (6.191). Thus 


do(r) =— S24 | A(r')yo(r") eiVeuarriemale—rl ds" (6,219) 


lr—r’| 
In the notation of (6.209) and (6.210), this becomes 


oor | : , 1 i cos y 
r) = ———- | A(r r ex r—r’ 
Wio(F) in (r’ )ho(r’) ir—r’| Pp r | | 


esp (~ Ir—r'|) dr’ (6.220) 


Thus, the field inside the conductor can be expressed in terms of that 
outside, and we see that it decays into the conductor in general with the 
attenuation length A, = (A/sin y) given by (6.213). 

While the formulation of the problem as an integral equation which 
we have given here is not particularly useful for detailed calculation, it is 
clearly useful for obtaining an understanding of the physical phenomena 
involved. 


Problem 6-26: Treating the conduction term as a 
source, obtain the alternative integral equation for the problem 


= _ op we] 
(rr, t) *, x(r, t) Ag i ot' r=t—(|r—r'|/c) 


xX 


ere 
par Ae) ae 


19. The Equations of Superconducting Electrodynamics 403 


x(r, t) being a solution of the wave equation in a completely nonconducting 
medium. For the case 

x(r, t) = Xo(r)e—™ 
show that this becomes 


eiele)|r-e'| 
bolt) = xolr) +" ce | bolt yo [1—A(r')] dtr’ 


Problem 6-27: Consider a cylindrical wire of con- 
ductivity o and radius a in a nonconducting medium in which a plane 
wave of circular frequency w is propagated in the direction of the axis of 
the wire. If a is considerably smaller than the skin depth, show that w,(r) 
is given approximately by 


e —i(w/c)|r—r’ | 
bolt) = x0(t) +3 | yor Uae a 
Hence determine approximately the field of the wave perturbed by the 
conducting wire. 


19. The Equations of 
Superconducting Electrodynamics 


Other problems, rather similar to those encountered in the theory of 
diffusion of thermal neutrons, arise in connection with the propagation 
of electromagnetic waves in superconductors. The similarity extends, 
however, only to the treatment of stationary problems; the time de- 
pendence is quite different in the two cases. Basically, the problem is one 
of attenuated waves. We consider some of its aspects in the remainder of 
this chapter. 

In the original formulation of London, a superconductor was 
characterized by a “‘supercurrent” proportional to the vector potential 


1 
Js=— TA (6.221) 
\2 being a constant characteristic of the superconductor. Clearly, how- 
ever, such a hypothesis is not gauge invariant. The equation obtained by 
taking the curl of both sides of the equation is, however, 


vVxJ,=-—;B=—H (6.222) 


Gauge invariant results may then be obtained by using the curl of the 


404 Problems of Diffusion and Attenuation 


equation involving the current. The electrodynamic equation 


VxXH=J+e% (6.223) 


takes, in the absence of external currents (“‘sources’’), the form 


VXH=ocE-SAt+e= 


ot 
Taking the curl and substituting for V < E from (3.16) yields 
_vy—_,, 28 _1y_,,, oH 
V?H oH H—eyu WD (6.224) 


For very many problems the conduction current cE is negligible com- 
pared with the supercurrent. For sufficiently high frequencies this is not 
of course the case. However, we confine ourselves to problems in which 
its neglect is justified, and deal only with the equation 


2 
-VH+SH+ eu S=0 (6.225) 


A similar equation may be obtained for A in the radiation gauge, in 
which (6.221) is valid, by expressing H and E in terms of the vector poten- 
tial in (3.17). The equation for E can also be obtained by taking the curl of 
(3.16): 


Vx (VX E) =—p2 Vx Hi 


_1lo0A_ GE 
2 ar <M ar 

1 1 E 

=~ \3E opr 


It is, of course, most convenient to work with the vector potential A 
in that it yields both the electric and magnetic fields, which are necessarily 
correctly related to each other. Let us in fact calculate the Green’s 
function for A. If we neglect the conduction current and assume an 
external (source) current jp, the equation for A is 

1 1 0?A 


—WAt 3 A+a32 7 TAR (6.226) 


Problem 6-28: At a large distance from a super- 
conducting sphere of radius a there is a constant magnetic field Ho. Show 
that the field as a function of position is given outside the superconductor 


19. The Equations of Superconducting Electrodynamics 405 


where 2 5 h B 
coth fa 
y= = Horeos oli + = s(1+ 35-3 Ba cot Be) | 
while inside it is H = V x A, the A being given by 


_ 3 Hoa [cosh Pr sinh Br\ . 9 
¢ ~~ 2 sinh Ba Br B’r? sa 


] 
where B = x and @ the angle between H and r. 


Where is the largest field in the superconductor, and what is its 
magnitude? ° 


Problem 6-29: Determine the pattern of currents 
flowing in the superconductor in the previous problem. 


Problem 6-30: Show that an electromagnetic wave 
of sufficiently high frequency impinging normally on the plane surface of a 
London superconductor can be propagated into it. Calculate the trans- 
mission probability through a superconducting slab of thickness / > X. 


Problem 6-31: Verify the self-consistency of a per- 
sistent current in a London superconductor as follows: 
(a) If there is a magnetic field AH in the y direction at the surface 
z= 0 of asemzi-infinite superconductor, show that the field inside is given 
by the vector potential 


A, = — diye", A,=A,=9 
This then gives rise to a surface current distribution in the superconductor 


of ‘i 
— -*9 5-2/X 
Ss Ne e 


(b) Determine the resulting field outside the superconductor due to 
this current, using the relation 


and show that it has the value H, at the surface and is in the y direction. 


Problem 6-32: Solve the similar problem for a cur- 
rent flowing in the skin depth along a cylindrical superconducting wire of 
radius R > X. Verify that the mechanism is as follows: An external field 
H, having value H> at the surface penetrates into the wire, setting up a 
current in the z direction according to London’s equation (6.221). This 
current then produces an external field whose surface value is Hp. 


406 Problems of Diffusion and Attenuation 


We can define the vector Green’s function by the equation 


-VG+46+4 at = u5(r—r’)8(t—1') (6.227) 
u being an arbitrary unit vector. 

Some time after the work of London, it was pointed out by Pippard 
that the London equation (6.221) should be replaced by a “nonlocal” 
relation (that is, one in which the current at r should be determined not 
only by the field at r, but by that at neighboring points). To describe this 
“coherence effect” he proposed for the supercurrent the relation 


J,(r) =C | LAG) GaP) HE) eserves! (6.228) 


[r—r'|* 


where & = (1/8) is known as the ‘“‘coherence length.” In this case 
(1/A?)A in (6.226) must be replaced by «J, as defined. Equation (6.226) 
and the Green’s function equation (6.227) then become integro-differential 
equations. The constant C is to be determined by the condition that the 
present form reduces: to the previous one when one integrates over the 
function multiplying A. 

The additional difficulties introduced by Pippard’s hypothesis seem 
less formidable if we take Fourier transforms in the Green’s function 
equation 


4 
G (k, w) = (2) | G(r, t)e-ik-r-ot) d3r dt (6.229) 
Since J, is a convolution in the spatial variables, its transform is (277)? 


times the product of that of A and of (x;x,/r*)e*" where x; (i= 1, 2, 3,) 
are the coordinates. But the transform of the latter is 


Problem 6-33: Show that the Fourier transform of 
(x;x,/r*)e-*" is given by Eqs. (6.231) to (6.233). 


Ge) delta 9088 


dank (s —_ kiki 
a i tan B 5 (du k2 ) (6.230) 
= f,(k)8u+ Falk) 8 (6.231) 


f, and f, being given by 


i= aap ligt) AH oe 


19. The Equations of Superconducting Electrodynamics 407 


and 


3 _ 3B rap k_ J (6.233) 


h= (22)? Fa i3 tan B. ign" B 
Since the transform of (6.228) is 
(2ar)®CA(k, w) fod a kK 
and that of the corresponding London term is 
A(k 
— 792 A(k, @) 


we may determine C. For the integral of (x;x,/r*)e~®" is (27r)* times its 
transform for k = 0, viz., (477/38)8,;. Consequently, 


C= _3B_ 
=-pae (6.234) 
and the transform of J,(r) is 
FIle) =—FB| Falk, wo) + 7a fik(k-A(k,o))] (6.235) 
ye 
3 ra l= 


where f,, fe are 47r?/B times f, and f;, respectively. 
In the case of the London equation, the transform G(k, w) of G(r, £) is 
given by 


—_ — — ag 4 —i(k-r’—ot’) 
(x2+J5-$) 6k, 0) = (Fi) ue (6.237) 


On the other hand, the Pippard modification leads to 


oe re ~ k[k- G(k, w) | 


4 
=(4) we-itk-r’—or) (6.238) 


Taking the scalar product of the last equation with k, we find 


(u- k)e —i(k:r’—ot’) 


(Ie) + GRMED A+R) «739? 


k- G(k,@) = (5-) 


408 Problems of Diffusion and Attenuation 


If we substitute this in (6.238) we obtain the Fourier transform of the 
Green’s function: 


1 
G(K, ©) = Fe (a]e8) + B/N) UK) 
x fu 3 folk) k(k - u) 
Dek? k— (w/c?) + (3/20) (Fh) 


4 
x (s-) e-ier ar’ (6.240) 


Clearly, the inversion of this formula presents some difficulty in general. 
For practical purposes, it could be carried out by doing the angular 
integral explicitly in the Fourier integral, and doing the k integral 
numerically. 

Though explicit solutions are difficult to obtain in the Pippard model, 
they are not in the London one. Let us first consider the Green’s function 
for the static case. This is obtained from (6.237) by multiplying by 27 
and putting w = 0. The Green’s function may be found by evaluating the 
Fourier integral by contour integration, to give 


G(r, 1) = yo entihy (6.241) 


Problem 6-34: Derive the integral equation 


A(r’) = 


1 A rr e —|r—r'|/A , 3 


| 

for the vector potential in a medium consisting of a superconductor em- 
bedded in a normal material. A = 1 in the normal material, 0 in the super- 
conductor. Hence show that fields penetrate into the superconductor 
only for distances comparable with A (the “‘penetration depth’’). 


Let us turn next to the dynamic problem. By inverting (6.237) we 


obtain 
G(r,t) = (x yu | ecomcee d3k dw (6.242) 
k2+ (1/2) — (w?/c?) 


If we try to invert with respect to the time variable we meet the diff- 
culty encountered previously for ordinary electrodynamics, of having 
poles on the contour of integration. It is in fact convenient to have re- 
course to the conduction current to circumvent this difficulty. It intro- 
duces, on the left-hand side of (6.226), the additional term jo(@A/d?). 
A similar term in the G equation adds to the denominator of the integral 
of (6.242) the quantity — iwop, o being the conductivity. The poles in the 
w integral then have negative imaginary parts. Consequently, for t > 0, 


19. The Equations of Superconducting Electrodynamics 409 


when we deform the contour of integration around the lower half-plane, 
we get a nonzero result. On the other hand, as would be expected, for 
t < 0 the result is zero, since the contour must then circle the upper half- 
plane, in which there are no singularities. The dissipative term has once 
again distinguished the direction of time, and ensured the satisfaction of 
the causality requirement. 

After evaluating the integral, it is of course possible to let o — 0. If 
we proceed in this way, we obtain 


ps > OY = ee ee : 25 ik-r 
G(r,t) = aeyu | Bt sinc \/k?+—51e% d*k (6.243) 


We can now integrate over the angles of k, leaving an integral in the 
magnitude k only: 


Cc = 1 : ; 
G(r, t) =a" | Vera Vk2+ (1/A?) t sin kr k dk 


Cc a [” l . 
=~; 04 re Vena ine VEE a2) cos kr dk 


(6.244) 


The advantage of the second form is that the integral is convergent, 
whereas in the case of the first, the behavior is delta-function-like, as is 
evident from considering the behavior of the integrand for large k. 

Let us make, in (6.244), the substitution 


ee ee 
k=5 (<—-4) (6.245) 
which transforms the integral [which we call I(r, 2)] to the form 
al eet pa cps ( 1\ ae 
I(r, t) I sins) (<+ 7) eos (:-7)¢ (6.246) 


“AL, oC) (Beane 
“al-DF 
=4 Im [_ {exp 34 |e(ce+ r) +; (ct—r)| 


de 


expe |e(cr=r) +4 (cr+r)|} 4 (6.247) 


410 Problems of Diffusion and Attenuation 


We must now consider two cases separately, (i) r > ct and (ii) r < ct. 
(i) Make, in the first term of the integral, the substitution 


rect 
C r—ct 
and in the second 
—o 
(4 =n 


Then 


+exp (5) Veet (n—2) 1 (6.248) 
Since the integrand is real, for r > ct 


I(r, t) =0 (6.249) 


in keeping with the requirement of the causality condition. 
(ii) Whenr < ct, we put in the first term 


{ ct+r _ 
ct—r 7 
and in the second 
= 
4 


to obtain 


d 
I(r, t) “tin exPay Vcert—r (n+) = (6.250) 


We note, with respect to this formula, that when 7 has a small 
positive imaginary part, the integral becomes very large: a small negative 
imaginary part, however, leaves it perfectly well-behaved. It is therefore 
expedient to deform the contour so that, in the neighborhood of the origin, 
it makes a small semicircular detour into the lower half-plane as shown in 
Fig. 6.4. Since the integrand becomes vanishingly small as the radius e€ of 
the ‘“‘detour’’ — 0, this does not affect the value of the integral. It may now 
be deformed around the upper half-plane, the integral in a semicircle of 
infinite radius going to zero exponentially. 

The resulting contour integral around the origin can be shown on 
using (4.390) to be 


I(r, t) = 5 Im 27iJo ¢ Ver—F) 


= TJ (; Ver? — 7) (6.251) 


20. Meson Equation 411 


Figure 6.4 


We can then write down, for all ranges of the variables, the formula 
for the Green’s function 


G(r,t) = cere r)Jo ( Ver) (6.252) 


c 
~ 4ar” ar 


where S is, as usual, 1 for positive argument, 0 for negative argument. On 
differentiation, this becomes 


fag Ji (U/r) Vc??? —r*) 
G(r, aul Zale r\~ Fop3 S(ct— rt whe | 6.253) 


20. Meson Equation 


The eq. (6.226) has the same form as that for a vector meson 
field in particle physics. The corresponding scalar (or pseudoscalar) 
equation would describe corresponding meson fields. We can, therefore, 
use our results to obtain the Green’s function of the classical meson field. 
For the meson field, the constant 


A=— (6.254) 


h being 1/27 times Planck’s constant, m the meson mass, and c the 
velocity of light. 

It should be noted that, either in the case of the meson field or that of 
the superconducting electrodynamics, a disturbance is propagated with a 
continuous spectrum of velocities extending from zero up to the velocity 
of light. 


PRELUDE TO CHAPTER 7 


The theme of this chapter is probability and random 
processes in physics. 

After some preliminary remarks, we introduce 
first the notion of probability generating functions 
(p.g.f.) for problems in which the distribution of the 
random variable is discrete, and obtain formulas for 
the moments of the distribution in terms of it. As a 
simple illustration, we consider the problem of coin 
tossing. It is shown that, as the number of trials 
becomes very large (“‘large number limit’) the distri- 
bution approaches a Gaussian. 

A generalization is made to a similar experiment 
in which the probabilities of the two outcomes of a 
single trial are not equal. 

A related physical problem is that of density 
fluctuations in a gas. The relation of the extent of 
density fluctuations to the size of the volume sampled 
is calculated. 

The problem of the Poisson. distribution, which 
is the distribution of the number of random events 
taking place in a given time interval when there is a 
given probability per unit time per event, is then 
solved. Again, it is shown that the distribution is 
Gaussian in the large number limit. The problem is 
then modified, in the case of radioactive counting, to 
take account of the “dead time” of the counter 
following the registration of a count. 

The problems considered up to this point have 
concerned discrete random variables. When the result 
of the probabilistic experiment has a continuous spec- 
trum, modified methods must be used. It is shown that 
the Fourier transform of the probability distribution 
is then a p.g.f. 

We next calculate the distribution of the sum of 
independent random variables. The problem its easily 
solved using Fourier transforms and the convolution 
theorem. In the large number limit, a Gaussian distri- 
bution is determined, and the root mean square (rms) 


413 


414 Prelude to Chapter 7 


deviation calculated in terms of those for the individual 
variables. This is the expression of the so-called 
“central limit theorem.”’ The corresponding problem 
is solved for the case in which the independent 
variables are vectors. 

In the next section, the result just mentioned is 
combined with the Poisson distribution to provide a 
solution to the “random walk” (random flight) 
problem. The solution obtained is, at large distances 
from the origin, that of a diffusion equation, as might 
be expected. 

We then consider the problem of fluctuations in 
systems involving multiplication (for example, 
neutron chain reactors, cosmic ray showers, biological 
populations). We solve the cosmic ray shower problem 
according to a simple model. It is shown that the 
fluctuations are no longer relatively negligible in the 
large number limit. This depends on the fact that a 
fluctuation can propagate itself. 

The next section is concerned with the Markoff 
method, which relates probabilities to volumes in a 
‘““sample space’”’ using Fourier transform methods. As 
applications, the two-dimensional random. walk 
problem is solved, and the three-dimensional one 
re-solved. A further application is that of the random 
distribution of a fixed amount of energy among an 
arbitrary number of particles (this problem can arise 
in nuclear reaction theory, or in the statistical theory 
of the microcanonical ensemble). 

Finally, we include a brief discussion on sto- 
chastic processes, using the relation developed in 
Chapter 4 between the autocorrelation function and 
the power spectrum of a time series. 


REFERENCES 
Feller, W., Probability Theory and Its Applications. New Y ork: Wiley, 1950. 
Miller, K. S., Engineering Mathematics. New York: Rinehart, 1956, Chapter 7. 


Parzen, E., Modern Probability Theory and Its Applications. New York: Wiley, 
1960. 


Wax, N., Selected Papers on Noise and Stochastic Processes. New York: 
Dover, 1954. 


PROBABILITY AND 
STOCHASTIC 
PROCESSES 


‘‘Man can believe the impossible, but can never 
believe the improbable”’ 


Oscar Wilde 


1. Introduction 


We discuss in this chapter some elementary notions of probability 
and of random (stochastic) processes. All of the problems to be considered 
are characterized by variables whose values in a specific experiment or 
trial are uncertain, but are governed by an a priori probability distribution. 
For purposes of illustration, it is useful to think of the problem of tossing a 
coin. An experiment has two possible results—the coin may come up 
heads or tails. The usual assumption is that either outcome is equally 
likely. (Though the outcome of a particular trial would presumably be 
completely predictable from complete information about the conduct of 
the experiment.) What the assumption means is that our knowledge is 
quite incomplete, but the variation in the conditions of repeated trials are 
sufficient that both outcomes are equally likely. Presumably, if one flipped 
coins by machine, and eliminated all variation in external conditions, one 
would always get the same result. The randomness in the usual coin- 
flipping is introduced by variations in the behavior of the person perform- 
ing the trials, and the fact that the outcome of individual experiments is 
sufficiently sensitive to the range of variation which the performer dis- 
plays. We might be uncertain whether, in fact, a bias might not exist in the 


415 


416 Probability and Stochastic Processes 


actual range of variation of performance of an individual. How could such 
a bias be detected? Strictly speaking it could not be, at least with certainty. 
Suppose the experiment were repeated a very large number of times, and 
one result was found to take place significantly more often than the other; 
for instance, heads might turn up 10% more often than tails in a thousand 
trials. If the result of an individual experiment were assumed to be per- 
fectly random, there would be a small but finite probability of the outcome 
in question. Therefore, we would not have demonstrated a bias. But the 
probability of the outcome arising from a perfectly random process would 
be very small (about | part in 10/8 in this case). The probability would be 
further reduced if the same 10% preference persisted through many more 
repetitions of the experiment, but could obviously never be reduced to 
zero. 

In some sense, every experiment in probability is a test of assump- 
tions about a priori probabilities. The results enable us to evaluate the 
validity of these assumptions. A commonplace example would be the 
throwing of loaded dice. If the probability for the distribution obtained 
in a series of trials would be negligibly small for unloaded dice, we can 
assume loading with a high degree of probability. 

There are many phenomena in physics concerning which we do not 
have complete information. These may be analyzed in terms of plausible 
assumptions about a priori probabilities, and the credibility to be assigned 
to these assumptions may thus be evaluated. Some concepts of physics 
are intrinsically probabilistic (e.g., ‘‘local’’ density of a gas in a large 
container, pressure, temperature, etc.). In fact in macroscopic matter we 
never have complete knowledge, and our theories must be formulated 
probabilistically, making what seem tentatively to be reasonable assump- 
tions about a priori probabilities. The success of the resulting theories in 
describing nature constitutes a test of our assumptions. 


2. Probability Generating 
Functions 


Consider an experiment involving the determination of a variable z 
whose possible values are a set of consecutive integers s = 0,1,2,...,n. 


Let the a priori probabilities of the various outcomes be designated p,. 
Then the function 


F(X) = Pot Pix + Pax? +--+ Pyx™ = By p xs (7.1) 


is designated the “‘probability generating function” (p.g.f.) of z. We note 
certain properties of this function. 


(a) FC) = potpitpet:+*+pr=>d Ps=1 (7.2) 


2. Probability Generating Functions 417 


d n 
(b) (£)_ =F => 9. (7.3) 


s=1 
is the mean value of z, which we call (z). 
(c) In terms of the p.g.f., the probabilities of the different values of z 


are 
_i(a 
oe s! (Z)... Se 


(d) The “rth moment” of the distribution function for z is 


M,=> sp.=|(x£)(xZ)--- (rtimes) (x) Ff] 7-5) 

this moment may also be written 
M, = (2°) (7.6) 
(z—(z)) =0 (7.7) 


by definition. A measure of the dispersion of values about the mean is 
given by the “root mean square deviation” (rms deviation) 


Az = V((z—(z))?) (7.8) 
In terms of the p.g.f., the rms deviation is obtained as follows: 


f" CQ) = 3 s(s—1)ps = (2?) — (z) 


Notice that 


But 
AZ AAS) 2a eh) Se e)* (7.9) 
Thus 
Az= vf"(1)+f'() -[f'() J}? (7.10) 


Let us illustrate the above with respect to the coin-tossing problem. 

Let an experiment consist of n tosses of a coin, and define the variable z 

as the number of heads which come up. The probability generating func- 
tion is 

f(x) =F =F (14x) (14x) +++ (ntimes) + (14x) (7.11) 


This is demonstrated by the following argument: 

(i) If we consider every possible outcome of a series of n tosses we 
assume that each is equally probable (assumption of equal a priori 
probability). 

(ii) The probability that the outcome of a given experiment Is z = s is 
equal to the number of experiments (out of all the possible ones) for which 
z= 5s, divided by the total number of possible outcomes. 

Since there are two possibilities for the result of each toss, the total 
number of results from the experiment is 2”. 


418 Probability and Stochastic Processes 


(iii) f(x) is evaluated in a power series in x in the following manner: 
All possible products are formed by choosing from each factor (1+ x) 
either the 1 or the x. The choice of a 1 may be taken to correspond to the 
throw of a tail; that of an x to the throw of a head. All of the resulting 
terms containing x* correspond to throwing s heads. The coefficient of x° 
in the expansion of x* is simply the number of terms in x’. Therefore, 


p, = coefficient of x* in f (x) = OF (7.12) 
From this statement, (7.11) follows immediately. 
A trivial result is that the mean number of heads is 
f= 5 (7.13) 


Note, however, that this seems equivalent to saying that, on an individual 
toss, heads and tails were equally likely. This, of course, was implied in 
our original assumption that all outcomes of the experiment were equally 
likely. If the coin had been subtly weighted so that the a priori probability 
of getting a head was greater than that of getting a tail on a particular toss, 
the various outcomes of the experiment of n tosses would not be equally 
probable; for example, the outcome of n heads would be greater than that 
of n tails. We see below how the p.g.f. would have to be modified in such a 
case. 

Before going into that problem, however, let us calculate the root 
mean square deviation: 


(Az)? =f") +f) — (f°) 2 


_n(n—l) nw 
=F + 


IS 


So that 
Az=4Vn (7.14) 


It should be noted that the root mean square deviation bears to the mean 
the ratio 
Az 1 
—~ = 7. 
(2) Va (7-19) 


CASE OF LARGE n 
The probability of a given outcome z=s may be written explicitly 
in terms of a binomial coefficient: 
_ n! L 
Pe "Si(n—s)! 2" (7-16) 


2. Probability Generating Functions 419 


For large values of n and s we may use Stirling’s formula (4.245) to write 
In p, = Inn!—Ins!—In (n—s)!—nIn2 
= (n+4) Inn—n+4 In 27 — (s+4) Ins 
+s—4$In 27 — (n—s+4) In (n—s) 
+ (n—s)—4thn2a7—nIn2 (7.17) 


The probability is a maximum where (0/ds) In p, = 0. But 


anes (¢42)1_ _ alt 
a5 sta) Ins+(n st+5 ay t in (a Ss) 


Neglecting, as in Stirling’s formula, terms which approach zero as n, 
s — «we then have 


or 


s = 


(7.18) 


Nils 


a not very unexpected result. It says that the most likely single outcome 
of the experiment is also the mean value. 
Let us now expand In p, about the mean value 


—f 
by putting 


s=5+0 (7.19) 
Then (7.17) becomes 


2 


-|5 I —o| In (F-o)—nin2—F1n 2a 


In p(o) = (n+5 In n—|"5 ! +o| In ($+0] 


If this is expanded in powers of o, there is no linear term. In fact 


=(ntt\inn—|2t! n,2o0_1f2c)\’, _.. 
In p(o)=(n+5) Inn 35 +o n+ 3 (22) + 


_fatl_ n_2o_1(20\’ l 
"5 o|(ing : 5 (2)'+ )-nin2—3 In 2a 


= In2—+iIn 2an—= 2 
2 n 


420 Probability and Stochastic Processes 


where again we have omitted terms whose ratio to the ones kept ap- 


proaches zero as n > ©. 
2 
== — p—(2/n)o2 
CC) = é€ 
P(c) \ a 


Therefore, 
2 
= eke! e —2/nXs~(z)? 
Ds nh 


2 = @ ~(s—(z))?/2(Az)? (7.20) 


This is a convenient analytical approximation to the distribution function 
when n is large, that is, when the experiment consists of a large number 
of independent random events. 

A quantity of interest is that deviation from the mean within which 
there is a 90% probability that the result of an experiment will lie. Calling 
this quantity So, it is defined by 


1 [~ (s—(z))? ] 
a exp | ——>7 a | ds = 0.90 
V 2a Az J -sy+(2z) 2(Az)? 


From tables of the integral, the standard “error integral,” it is found that 
So = 1.645. 

As a generalization of the coin-tossing problem, consider the 
problem in which the a priori probabilities of the two outcomes of a 
single trial are, respectively, p and g = 1 — p. The p.g-f. is then 

f(x) = (q+ px)" = (1 — p+ px)" (7.21) 


This reduces to the previous case when p= q=3¢. That (7.21) is the 
correct generating function is evident from the fact that the coefficient 
corresponding to the choice of s x’s and (n—s) 1’s is weighted with the 
factor p’g"-s, corresponding to the fact that each event associated with 
the choice of x had probability p and each one associated with the choice 
of a 1 had probability g. Let the two events, associated respectively 
with x and 1, be designated A and B (heads and tails in the previous 
problem). The mean number of A’s in n trials is then 


Ff’ (1) = np = (z) (7.22) 
and the mean square deviation is 
Az={f"(D+t+K (D-H (DP}P? 
= {n(n— 1)p? + np — n®p?}12 
= Vnp(1 — p) 
= Vapq (7.23) 


3. Density Fluctuations in a Gas 421 


Therefore the rms deviation as a fraction of the mean is 
Az_of@ (7.24) 
(2) np 
Problem 7-1: In an election, 55% of the voters favor 
candidate A and 45% candidate B. A public opinion polling organization 
samples 1000 randomly chosen voters. What is the rms deviation of the 
number of voters they report for each candidate? What is it as a per- 


centage of the mean number in each case? 
Answer: About 16, or 2.9% for A and 3.5% for B. 


Problem 7-2: Suppose that there are three candidates 
A, B, and C who are favored by 45, 35, and 20% of the electorate, respec- 
tively. Show that the rms deviation Az,, Azz, and Azc of the numbers 
reported for each candidate in a sample of 1000 are (to the nearest 
integers) 16, 15, and 13, respectively, so that the 90% ‘“‘confidence limits” 
are for A, 42.4 to 47.6%; for B, 32.5 to 37.5%, and for C, 17.9 to 22.1%. 


Problem 7-3: The lifetime of an artificially created 
radioactive nucleus is determined by a delayed coincidence method. 
Pulses from creation events are fed into a coincidence circuit after a 
time delay. Pulses from decays are fed in without delay. 

Suppose now that the coincidence counting rates c, and c, are mea- 
sured for delay times 7, and 75. 

For best results, what proportional times should be spent observing 
coincidences for each delay? 

Show that, if a total time f) is available for counting, the root mean 
square deviation in the derived decay constant in the best circumstances 
Is 

1 cp + Cy 2 


Vig T2771 


3. Density Fluctuations in a Gas 


Suppose we have a gas containing N molecules, each of mass m,ina 
container of volume V. We assume that each molecule is as likely to be 
at any point in the container as at any other. Consider a fixed subvolume v 
within the larger container. We may consider the probability of having 
various numbers 7 of particles in v, and consequently, the probability 
distribution for the density of gas in volume v. 


Problem 7-4: Consider two separate subvolumes v 
in a large volume V (v < V), the whole containing N molecules. Show 


422 Probability and Stochastic Processes 


that the probability generating function is approximately 


P(x, y) = pi(x)pi(y) 
where 
py(x) = enNelvna—2) 


and therefore that the probability distributions in the two subvolumes are 
effectively independent. Explain why this is so. [The result of this problem 
illustrates an important principle of statistical mechanics, viz., the 
mathematical independence of subsystems in equilibrium with a much 
larger system, even though they are interacting and not isolated from each 
other.] 


This is simply an example of the previous problem, where N is the 
number of “trials,” p= (v/V) is the probability of one result (finding a 
molecule in v) and g = 1— (v/V) the other (finding a molecule outside v). 
The root mean square deviation of the number in v is then 


_— Je ae 
A=\nz (1 7) (7.25) 
while the mean number is 
(n) = NG (7.26) 
Therefore the mean density in v Is 
ere.) ene 
p=m=my, (7.27) 


which is independent of the volume selected for measurement. However, 
the rms deviation 1s the fraction 


A _ fi-@IV) 

(n) (1) 
of the mean. If v/V < 1, that is, if the volume sampled is only a small 
fraction of the whole gas, this is 


ay = Vn (7.28) 


Therefore, so long as the volume v contains many molecules, the frac- 
tional density fluctuations are small. 


4. The Poisson Distribution 


Consider the problem of a slow radioactive emitter, which emits 
quanta or particles of radiation at random times, the probability per unit 
time of an emission being A. Let us calculate the distribution of number of 


4. The Poisson Distribution 423 


particles emitted in a time interval of length ¢. This is called a Poisson 
distribution. 

We start from the following statement: The probability that n particles 
have been emitted by time t+ 5t¢ is the probability that n were emitted by 
time ¢ and none in the infinitesimal interval 5¢ thereafter, plus the prob- 
ability that (n— 1) had been emitted by time ¢, and one was emitted in the 
subsequent interval 6¢. [The probability of two being emitted in 6t is of 
order (6), etc; since our aim is to derive a differential equation for 
P,,(t) we need only keep terms up to order 6t.] The mathematical formula- 
tion of the statement is 


P,(t+6t) = P,(t) (1 —Adt) + P,_, (t)ASt+ O(82?) (7.29) 


Dividing through by St and taking the limit as St — 0, we obtain the 
equation 


ot) = —)P,,(t) +AP,_1(t) (7.30) 


Giving n all possible values, n = 0, 1, 2, ..., we have a chain of coupled 
equations. They may be solved, however, by the simple device of intro- 
ducing the probability generating function 


f (x,t) = Pp(t)x" (7.31) 
If we now multiply (7.30) by x” and add the equations for all possible n’s, 


we get 
SD — (1-2) f(x) (7.32) 


The solution of this equation is 
f (x, t) = fo(x) en Mi-at 
But at t = 0, P, = 1 and all other P,,’s = 0, so that fo(x) = 1; therefore, 
F(x, t) = e~Mi-ant (7.33) 
The P,,’s may now be obtained by expanding f (x, t) in powers of x: 
ee 


f(x,t)= >. P,,(t)x" =e are (7.34) 
so that 
p,(t) =A" ae (7.35) 
which is the Poisson distribution. 
The mean value at time ¢ is 
tg a0 


which is in any case intuitively evident. 


424 Probability and Stochastic Processes 


The mean square deviation is 
(An)? = Xt (7.37) 
which gives us once again the square root law 


An__ 1 _ 1 

(n) Vin) Vat 

The distribution function is sharply peaked and narrow when At is 

large. This may be seen by taking the logarithm of (7.35) and using 
Stirling’s approximation 

In P,(t) =nIndAt—At— (n+) Inn+n—4ln27 (7.39) 


Taking 0/dn and omitting terms which approach zero as n — ©, we find 


(7.38) 


0 = — 
a In P,(t) =InAt—Inn 


which is zero at 
n=ndt 


The distribution function therefore has a maximum at this value. If we put 
n=dXtt+v (7.40) 
we can expand In P,,(t) in powers of v: 
: 1 ae eee 
In P,(t) = (At+v) IndAt—Att+ (arto+5)(in At+ va (ne? ) 
+rAt+v—$ln27 


. 1 ] eae 
= 5 In At 5 In 2ar rt (7.41) 


where we have omitted terms which go to zero as At — © and higher 
powers in v/At. In this approximation 


P,(t)=— exp (— x) 


V27\t 2at 
1] (n—At) | 
= exp | -—-~—————_- 7.42 
QrXt . 2nt ( ) 


which is again a Gaussian distribution whose width is = Vt. If we had 
kept terms ~ v3/(At)? Eq. (7.42) would have been multiplied by another 
negative exponential with an exponent proportional to this quantity. But 
since the distribution function is already negligible for n—At =v = VAt, 
and for smaller values the exponent in the new term is = (An)*/2/(AD? = 
1/V\t which — 0 as At becomes large, the approximation (7.42) is a good 
one for large At. 


5. AProblem in Counting of Radioactive Decays 425 


Problem 7-5: Suppose that exactly n calls are made 
through a telephone switchboard in an hour. What is the probability that 
vy calls will be made in any given minute? 

If the number of calls per hour follows a Gaussian distribution with 
mean n and rms deviation o, what is the probability for v calls in a given 
minute? 


Problem 7-6: Consider a fast radioactive emitter, so 
that one must take account of the decrease with time of the number of 
decaying nuclei. If there are N nuclei present initially, show that the 
probabilities P,,(t) of n decays in time ¢ satisfy 


oot = — dol (N=) Py— (N=2 4+ 1)Paat] 
and that the equation for the generating function is 


af (x; t) _— __ _ _ of (x, t) 
rye NXo(1—x) f (x, t) tHApx(1 x) 


where Ay is the probability per unit time of the decay of a given nucleus. 
Obtain the solution to the equation in the form 


f (x, t) = e~M""(1 + x(e*!— 1) ]4% 


and from it determine P,,(t), (n), and An. 


5. AProblem in Counting of 
Radioactive Decays 


An elaboration of the problem of fluctuations in rate of radioactive 
decay is provided by considering the counting of these decays electroni- 
cally, the counter having a ‘“‘dead time” 7; that is, after a count is regis- 
tered, another cannot be detected until a time 7 has elapsed. The question 
is, then, what is the probability of n counts actually being recorded in 
time t? 

Let us introduce a new probability Q,(t) such that Q,(t) dt is the 
probability that the nth count takes place in the interval (t, t+ dt). Then 


Onss(t) = f° On(t!)po(t=1") dt’ (7.43) 


where po(t—t') is the probability per unit time that the (n+ 1)st count 
takes place in the time interval (t, + dt) given that the nth took place at 


426 Probability and Stochastic Processes 


t'. But we know that 
P(t—t)dt=0, for t-t'<7 
= dt X[probability of no counts in interval (t—t’)] 
a at eat) (7.44) 


If Laplace transform of Q,(t) is g,(s), and that of po(t) is ao(s), by the 
convolution theorem the transform of Eq. (7.43) is 


Qn+i(S) = Gn(s)79(s) (7.45) 
We may thus determine gq, in terms of q;: 
Gn(sS) = [a9(s)}"-1q,(s) (7.46) 
But the transforms 779(s) and q,(s) are easily evaluated. 


To(s) =X if : e~Me~st dt 


Xr e Atsir 


wales (7.47) 


and 


q(s)= J. Oi(nen* de 
= J ” Ne-Mens* dt 
0 


= nN 
At+s 


(7.48) 


Therefore, 
en -DAts)r 


qa(s) = ya 


The inverse transform is easily calculated by the Laplace inverse theorem; 
it is 
Sot+ico e7(n—Dst 


(A+s)” 


— fe oin-Dar | 


Sui 


e* ds (7.49) 


So iS any quantity > — A. The integral may be evaluated by extending the 
contour into a semicircle of infinite radius on the negative side of so. On 
evaluating the residue at the singularity we find that 


_ , Alt= (m= 1)7)}"™ 
Q,(t) =X (n—1)! a, 


= 0, t< (n—1)r (7.50) 


Putting 7 = 0, we obtain the Poisson distribution. 
Before continuing, let us note the relation between Q,(t), which is 


t> (n—1)r 


6. Probability Distributions for Continuous Variables 427 


the probability distribution in time for the nth count, and P,(t), which is 
the distribution in number of counts up to a given time. The connection 
follows from the relation that the probability Q,(t) dt that the nth count 
takes place in the interval (t,¢+ dt) is the probability P,_,(t) that up to 
time (t—7) there had been (n—1) counts, multiplied by the probability 
\ at that another count took place in the interval in question. Thus 


QO,(t) = APp_(t—7) (7.51) 
or 


Pal) =~ One(t +7) 


_ [At nz)]" ene 


nI (7.52) 


As 7 > 0 this reduces to the Poisson distribution obtained earlier. The 
Laplace transform method therefore provides an alternative derivation 
to that of probability generating functions. 


Problem 7-7: For a given ¢, show that the most 
likely nis given by the transcendental equation 


ATU = elu 
where u = (t—nr)/nr. For Ar < 1, verify that 
we At 
1+2Ar 


6. Probability Distributions for 
Continuous Variables 


Suppose that we consider a random continuous variable z, such that 
the probability that its value is in the range (z, z+ dz) is p(z) dz. Then 


J © pz) dz= 1 (7.53) 


[If the range of values of z is limited, we define p(z) =O outside the 
limits.] The mean value of z Is 


(z) =f _ zp(z) dz (7.54) 
We define the nth moment of the distribution as 
M, = (2") = f__ ztp(2) dz (7.55) 


Also, the mean value of f(z) is 


(F(2)) = fo, fl@pl2) dz (7.56) 


428 Probability and Stochastic Processes 


Two variables z, and z, are said to be independent if the distribution 
of one does not depend on the value of the other; the joint distribution 
P(Z1, Zz) such that p(z;, Z,) dz,dz,. is the probability that the first lies in the 
range (Z,, 2: +dz,) and the second in the range (z., z,»+dz,). If they are 
independent 


P(215 Z2) = Pi(Z1)P2(Zz) (7.57) 
The mean value of f,(z,) +6(z,) is 
(filZ1) + falZe)) = (fiz) + ¢ fo(Ze)) (7.58) 
and that of f,(z:)fo(z,) is 
( filzfo(Ze)) = ( filZ1)) « fa(Ze)) (7.59) 


Problem 7-8: Suppose that a and B are independent 
random variables with a Gaussian distribution with rms deviation y. 
Define 


&(t) = acos ot + B sin wt 
and 


H(t) = a = — aw Sin wt + Bw cos wt 


Show that &(¢), n(t) are independent random variables. Show also that 
their distribution functions are 


ee | 


e 82/272 


—N2/2Y 2w2 


1 
P2(n) = V/ Ia yw e 


The moments of the distribution of z are easily expressed in terms 
of the Fourier transform of p(z), which is in fact a sort of generalization 
of the notion of probability generating functions to continuous variables. 
Let the Fourier transform of p(z) be 


a(x) = { p(z) e#* dz (7.60) 
p(z) may be written in terms of its transform as 


P(Z) = 5, J a(x) e~** dx (7.61) 


We note that 7(0) = 1. Also, from (7.60) it follows that 


(z) =—in'(0) (7.62) 
and in general 


M,,(z) = (2") = (— i)" F mx) | 
= (— i)" (0) (7.63) 


=0 


7. Independent Continuous Random Variables 429 


In particular, the mean square deviation of z from its mean is 
(Az)? = (z?) — (z)? = — (0) + [7'(O)P (7.64) 
We note that 


a(x) = (e#*) = 1+ ix(z) —* (2) fees 


+ EF (ony pone (7.65) 


7. Distribution of a Sum of 
Independent Continuous 
Random Variables 


Consider the distribution of 
Sy = ZH%ets ++ +2 (7.66) 
where z; is a random variable with a distribution p,(z;). We may relate the 
distribution P,(Sy) of Sy to that of Sy_;: 
Py(Sy) = [ Py—i(Sy—1)PoSw—Sy—1) Sy (7.67) 
Letting the transform of Py be II,(x), taking the transform of (7.67), and 
using the convolution theorem, we get the equation 
I y(x) = Uy—1x)ary(x) (7.68) 
Problem 7-9: Prove the following: 
(a) If the distribution of z is p(z), that of Z = f(z) is p(Z) = p(z)/ 
f'(z) where the right-hand side must be expressed in terms of Z. 
(b) The distribution of z2 = Z is p(WZ)/2VZ. 
(c) The distribution of sin z = Z is p (sin“! Z )/V1—Z?. 


Problem 7-10: If z, has a Lorentzian distribution 


(z,) = 2 
Prin W Ane + Zn" 
show that Sy has the Lorentzian distribution 

_A | 

Py 7 vTA?+S§ N° 
where 
N 
A=} a, 
n=1 


Problem 7-11: If z, has a Gaussian distribution 


— I _ real 


430 Probability and Stochastic Processes 


show that S, has the distribution 


ere eee _ | Sv=(Sw))? 
UntSw) = xs oP (AS)? I} 
where 7 

(Sy) = (zn) 
and 

N 

(ASWP= > A? 

It follows that 
Ti (x) = ao(x)ar, (x) + + + yx) (7.69) 


Taking the inverse transform, we may in principle determine the distri- 
bution of the sum. But (7.69) may now be written as 


Tly(x) = e® 272) (7.70) 
=exp > In [ + ix S\ (Zn) — S\ (Zn?) = Sd) (Zn?) + | 
: (7.71) 


To simplify the calculation slightly, let us choose the random vari- 
ables z, so that their mean values are zero (i.e., subtract from each its 
mean value). Then 


I1n(x) ee ys In E — (Zn?) — (Zn?) +-:- | 


We note that 7,,(x) is a maximum at x = 0, where it has the value I. 


(Fe) <i | zp(z) dz=0 


2 
(S2") = | z2p(z) dz <0 


0 


For 


and 


It follows that II,(x) has a maximum (also unity) at x =0. If we now 
expand the logarithm in powers of x 


5 : 
IIly(x) = exp |= Ss (Zn?) = S (z,*) + terms of order x4, etc.| 


x? x3 
= exp|—Nax*>—iNag" + 7 + (7.72) 
where 


m= Sa), a =T > (zn?) (7.73) 


7. Independent Continuous Random Variables 431 


Suppose now that A is a large enough quantity that e-4 is negligible; then, 
Ily(x) has a negligibly small factor if 


I 2A 
x> aN 
But at the largest value of x which we need consider, 
sn 2\ 2% 1 
las = 6 Os (=) As? Nin 


Continuing to the x4 term, it 1s 


aa gx 4/ 24 
where 


1 
4 = Hz > [3(zn?)? — (znt) ] (7.74) 
At x = V2A/a,N it has the value —$(a,/a,”) (A?/N2). We note that, in 


the limit of large N, the terms subsequent to the x? one are all very small 
in the range in which I], is not negligible. Therefore, in this limit 


2 
IIly(x) = exp |e ear esa) (7.75) 
where 
N 
(ASy)? = > (Az, )? (7.76) 
n=1 


Inverting the Fourier transform, we obtain the distribution of the sum Sy, 


a ae 
E v(Sw) = aay ex| Td aan 


This has been calculated for variables z, whose mean values (z,) are 
zero. Otherwise S,? in the above formula must be replaced by (Sy—(Sy))? 
where (Sy) = & (Zn): 

The result (7.77) is generally called the ‘““Central Limit Theorem.” 

A generalization of the preceding problem is that in which z, 1s 
replaced by a vector r,. If the distribution function p,(r,,) for this function 
is known, the problem is to find that for Py(R,,) where 


Ry — > Fn (7.78) 


If z,,(K) is the transform of p, (r,,) and Ily(k) that of Py(Ry), an equation 
analogous to (7.69) can be obtained by exactly the same procedure as 
used in the one-dimensional case: 


Ily(k) = 7, (k)72(k) . . -aty(k) (7.79) 


432 Probability and Stochastic Processes 


But 
aa(k) = { paltale™™ dr, (7.80) 


= ] a ik : (rn) —% > kak XnaXnp) 2 O(k?) (7.81) 
auB 
Xny (@ = 1,2,3) are components of the vector r,. Therefore, 


N 
IIy(k) = exp > In {+a (r,) —4 DS kaka(XnaXns) + °° 1 
n=! 


avB 


If we choose the r,,’s to have mean value zero, to a good approximation 
for large N 


IIy(k) = exp \-4 k,kgM Pas (7.82) 
a.B 
where F 
M Pon = dX (XnaXns) (7.83) 
n=1 


To evaluate (7.82) it is necessary to transform axes so as to diagonal- 
ize M¥,.s. The inverse transform is then calculated as the product of three 
one-dimensional Gaussian transforms, one over the components in the 
direction of each principal axis. 

There is some simplification in the case in which p, (r,) depends only 
onr, = |r,|. Then (r,,) = 0 automatically and 


(XnaXng) co i P(tn)XnaXng d*r,, 
= 3(In?) 808 (7.84) 


This follows from the fact that, when a # £, the integrand is odd in the 
variables x,. and x,g, and also that (x2,,) = (Xp)? = (Xn3)? = 47,2 due to 
spherical symmetry. In this case 


IT, (kK) = €xp [—8k?(R,,) ] (7.85) 
where 
N 


(Ry?) = > (rp?) (7.86) 


=] 
Taking the inverse transform 


ets i aN 
Pu(Re) = Tama R PE |~2R Men 
This is the probability per unit volume at Ry. 


The problem just discussed is one version of the so-called “‘random 
walk”’ problem. 


8. Relation between Random Walk Problem and Diffusion 433 


8. Relation between Random 
Walk Problem and Diffusion 


Consider the following problem: A point is moved in random steps, 
the length of a step being determined by the probability distribution p(r), 
and the probability per unit time of a step being A. What is the probability 
distribution for the resulting total displacement in time t? 

If the number of steps taken in time t were known, the solution 
would follow immediately from (7.87). However, the number of steps is in 
fact given by the Poisson distribution, the probability of N steps being 
[(At)*/N!] et. Now the transform of the distribution for N steps is 
given by (7.79) 

Ty (k) = [w(k)]” (7.88) 
a(k) being the transform of the probability distribution for one step. 
Thus, the transform of the distribution of displacement in time ¢ is 


M(k, 1) = AY etary)" 


= e—M[I—a(k)] (7.89) 


If At is large, most of the contribution comes from near k = 0; we can 
therefore use the expansion of 7(k) which follows from (7.81): 


aw(k) = 1—2k*(r?)+--- (7.90) 
so that, approximately 
Il(k, t) = exp [—#atk?(r?)] (7.91) 


This is readily inverted to give the probability distribution P(R, ¢) for 
displacement in time f: 


PRD = Tomy (7.92) 


This result is clearly the same as that for diffusion from a unit point 
source, given by (6.25). The identification of the parameters in the two 
problems 


B=éPr) (7.93) 
gives a “first principles’’ expression for the diffusion constant B. 


Problem 7-12: In two dimensions, show that if each 
step is of length a but in an arbitrary direction, 


a (k) = Jo(ka) 


434 Probability and Stochastic Processes 


Hence determine the mean square displacement after N steps for arbi- 
trary N. Show that this result may also be obtained by elementary means. 


Problem 7-13: By considering the case N = 1 show 
that 


1\ —ik- 2 sagt — 
(x) [ olka) e ° dk = 5" 8(p—a) 


Problem 7-14: By calculating directly the probability 
distribution for N = 2 show that 


G) J [Jo (ka) ]? e~*? d?k = VITO 


Problem 7-15: Show that, in three dimensions, if 
each step is of length ‘‘a” but the direction is arbitrary 


_ sinka 
a (k) aaa es 
(a) Show directly that the mean square displacement in WN steps is 
Na’. 
(b) Find explicitly P,(R). 


9. Problems Involving Multiplication 


Suppose that we have a system in which, by some method of repro- 
duction, elements give rise to “offspring.” If there is a fluctuation in the 
population, we would expect it somehow to propagate itself. The question 
poses itself, whether this might not give rise to much larger fluctuations 
than those discussed up to now, which became proportionately less and 
less significant in the large number limit. 

A simple example may be drawn from physics. Consider a simple 
model of a cosmic ray shower. Anelectron or positron, on passing through 
the atmosphere, has a finite probability per unit length of producing a 
pair, which we call A, How many electrons and positrons result from one 
incident one in a distance /? 

Let the probability that there are n at a distance x be P,,(x). The 
probability that there are n at distance x+6x = (probability of n at 
distance x) (1—nddx) + (probability of n—2 at distance x < [(n—2)A8x], 
that is, 

P,,(x+ 6x) = P,(x) (1 —nd8x) + Pp» (x) (n—2)dA8x 


Dividing by 6x and taking the limit 6x — 0 we obtain the differential 


9. Problems Involving Multiplication 435 


equation 
aP,, 


ae ndAP,, + APn-2(n—2) (7.94) 


We can introduce the generating function 


f(E) =D Palx)é" (7.95) 
and write (7.94) in terms of it by multiplying through by &*: 


of(x,€é)_ OF, yg OF 
= ne 5et AE 3e (7.96) 
The introduction of a new variable 
es aa a 
E(1—€) 
= In €—$l1n (1 — €) (7.97) 
transforms the equation to 
af, , F_ 
aot IN aa 0 (7.98) 
for which 
f= F(u-—Ax) (7.99) 


is a solution, F being an arbitrary function. To determine the function, we 
use the initial condition F(u) = € when x =0. The problem is then to 
express € in terms of u; but 


& 
[27 eu (7.100) 
Therefore, 
eae 
1+ e7 
or 
o- [1+ e2]12 (7.101) 
It follows that the generating function is 
ee eee Cee 
f (x, &) VI-& [1+ €2/(1—&) e72Ax | 1/2 
-Ar 
ge (7.102) 


The probability of 7 particles at distance / is different from zero only if nis 
odd [that is, (2s + 1), say] since only pairs are produced. Specifically 


Posy = em (~*) Che." (7.103) 


436 Probability and Stochastic Processes 


C)) 


is the coefficient of y* in the expansion of (1 —y)~"?. The mean number is 


where 


(n(l)) = e™ (7.104) 
A straightforward but slightly tedious calculation also gives 
An(l) = V2(e! — e?4) (7.105) 


We see, then, that fluctuations larger than the mean value are quite 
probable, a peculiar feature of processes involving multiplication. 


Problem 7-16: Suppose that a colony of amoeba 
subdivide in two with a probability ’ per unit time, and die with a prob- 
ability 44 per unit time. Starting with 1, determine the probability that n 
will be alive at time t. 


10. The Markoff Method 


The Markoff method is one based on Fourier transforms, and on a 
geometrical representation of the assumption regarding a priori prob- 
abilities. To illustrate it, let us return to the “random walk” problems in 
two and three dimensions. 

Consider first the two-dimensional problem. The steps of the walk 
are characterized by angles @,; let us assume that they are all of length 
‘a.’ Consider the N-dimensional “space” of which all the 9@,’s are 
coordinates; it is a space of volume (27r)*. The equations 


a > cos 0, = (X,X+ 8X) (7.106) 
a> sind, = (Y,Y+6Y) (7.107) 


define a region between infinitesimally separated “‘surfaces”’ (of dimen- 
sionality N —2) in this space. The probability that the coordinates of the 
displacement resulting from the N random steps lie in the range 
(X,X+65X; Y, Y+6Y) is simply the volume of space lying between these 
surfaces, divided by the total volume of the space. The fact that it is only 
the volumes which matter is a consequence of the assumption of equal a 
priori probability for each value of each of the 6,’s. (Otherwise, we would 
have to calculate volumes weighted by the probabilities of the various 
values of 69,.) That the probability does depend on the volumes as stated 
follows from the fact that each point in the 6 space represents a possible 
“walk,” all such walks being, in fact, equally probable. The probability 


10. The Markoff Method 437 


required is then that fraction of the total volume corresponding to the 
walks in question. 

The second step in the Markoff method is to express the volumes in 
terms of Fourier transforms. The volume is in fact the integral over the 
6’s of a function which is 1 in the regions defined by (7.106) and (7.107) 
and zero elsewhere. Let us call this function A. Its Fourier transform as a 
function of X and Y is 


J A ettkiX+keY) UY dY = eikiEacos On pikzEasinong yay 


Therefore by the inverse Fourier theorem, 


2 
A= 8XS5Y (x) | e~iki\X—a¥ cos 6n) g—ikeY—aEsinOn) dk dk, (7.108) 


Let us now write 


k,=kcosa, k, =k sina (7.109) 
We then have 
A= 6X6Y (a) | etki X+k2Y) eikaz cos (On—@) dk da (7 1 10) 


The integral of A over the 9,’s, divided by the volume of 6 space, that is, 
(27r)’, is the required probability. It is 


2 
P,(X, Y)6X6Y = 5xaY (5) [ ewe [Jo(ka)]%k dk da (7.111) 


where 


p= VX*+¥2 (7.112) 


Therefore the probability per unit area for displacement p is 
] 
Pup) =3- | Jo(kp) [Jo(ka) Ik dk (7.113) 


This is the same result obtained by previous methods. 


Problem 7-17: Find Py(p) when the probability of a 
step making a given angle @ with the x axis proportional to cos? 6, that is, 
is 2 cos? 8. Show that in the large number limit it is 


Ps(p) = — exp |- 42-4] 
el NViNna | 3N@ Na? 
We may also apply the Markoff method to the three-dimensional 
problem. The generalization of (7.108) to three dimensions is clearly 


3 
A= 6X6Y6Z (=) | e~ik (Rin) q3k (7.114) 


438 Probability and Stochastic Processes 


where 
\r,| =a (7.115) 
To obtain the volume in the sample space corresponding to displacement 


to a volume d°R at R we integrate over the polar angles of the r,,’s. The 
total available volume is (47r)%. Therefore, 
1 \3/ 1 \¥ 
3p — {(_~) {_~— Peres —ik-R pLik-r, 43 3 
Py(R) dR (s-) (Fa) | 40, dy | ¢ eZ 3k dR (7.116) 


where the (’s are the solid angles of the r,’s. Evaluation of the angle 


integrals gives 
1\? { /sinka\’ _. 
Py(R) = (5) | (4) e—k-Rd3k (7.1 17) 


as before. 


11. Random Distribution of 
Energy among N Particles 


Suppose that a system of N particles of mass M has a fixed energy in 
the range (Ey, E,+6E,) but the energy is shared among them randomly. 
Let us find the probability distribution for the energy of any one particle. 

We must, of course, make an assumption about a priori probabilities. 
We do this in terms of the momenta of the particles. Consider a ‘phase 
space’’ in which the coordinates are the 3N momentum components. We 
shall assume that, a priori, every point in this space is equally probable. 
However, the only available part of the space is that for which 

a (pi? + po? ++ + ++ Py?) 
lies between E, and E,+6£ 5. Its volume is that which lies between the 
hyperspheres 


N 
> Pn? = 2ME, (7.118) 
and 
N 
> Pr? = 2M(E,+ 5Ep) (7.119) 
n=1 


The probability that the energy of the first particle is between E, and 
E, + dE, is that fraction of the above volume for which 


2 
E, < aa < E,+dE, (7.120) 


Although the volumes in question could be evaluated by the method 
of Fourier transforms, as in the previous problems, they may in this case 


11. Random Distribution of Energy among N Particles 439 


be obtained more readily by a recursion method employing Laplace 
transforms. 
The volume within a “sphere” of radius Vu in v dimensions is 


Vu) = [ dp,dp, + dp, (7.121) 


(pyr+p+---+p,? < u) 


For a given p, the limits of integration of the other (v — 1) coordinates are 
(0, u— p,”). Therefore, 


Vu) = J Vi-s(u—p,?) dp, (7.122) 


Taking Laplace transforms of both sides of this equation, and 
defining U,,(s) as the transform of V,(u) 


U,(s) = U,-1(s) [ e*** dp, 


1/2 
= U,_,(s) (=) (7.123) 
Therefore, 
qT 1/2(v—1) 
Uj(s) = () Uv) (7.124) 
But 
V(u) = Vu 
so that . 
Us) =| we du= TO Va (7.125) 
p Ss Ss 
Therefore, finally 
qr l2 
Us) = porye| (7.126) 


The function of which this is the transform is immediately identifiable as 


gr ¥l2 u vi2 


Vou) = FG p2) +1) 


(7.127) 


Thus, the volume in the 3N-dimensional space of the momentum 
components corresponding to u between 2M E, and 2M(E, + SE) is 


d 
Vr = 2MS5E, + Vaulu) 


u=2MEo 
3N 1 
— sni2 218 1 
2MS5E or 4 (2ME)) T((3N/2) + 1) 
= (2M7)sN2 E,2N)-1 8E, (7.128) 


(3N/2) 


440 Probability and Stochastic Processes 


That part of the volume corresponding to p,2/2M in the range (E,, E, + dE) 
1S 


V, _ d3 2 


<2 
TE, Vxn-v (2M(Eo— Ex) 3Eo 


dE, ee 


gp 3(N—D12) 


= 6 Eo TRW—1ID (2 Myp-v2( Ey =_ E,pee-p-1 2n(2 My)??? E, dE, 


(7.129) 


Therefore the probability that the energy of the first particle lies in the 
range (E,, £, + dE,) is 


Vi» 
P(E,) dE, = V, 
2 V(3N/2) (Eg— E1)2!8- 1B}? 
~ Var T3(N—1)/2] w———pawaaa AE (7-130) 


Problem 7-18: Verify directly that f P(E;) dE, = 1. 


Problem 7-19: Show that (E,) = E,/N. 
Problem 7-20: Show that 


- [2 = 1) 
AE; = Ey N2(3N +2) 


(Note that the VN law does not apply to this problem.) Prove that the 
central limit theorem is not valid for this problem. Why? 


Problem 7-21: Show that, for large N, the distribu- 

tion becomes approximately the Maxwell distribution 
oe Gs 
Var \2Eo 


; e 3NE1I2Eo E,}? 


12. Stochastic Processes 


By a random or stochastic process x(t) we mean a process in which 
the variation of x with ¢ does not follow a definite law, but rather contains 
an element of chance. We designate x() as a “signal.’’ The quantity of 
greatest interest in such a signal is usually its “power spectrum” or 
intensity distribution, which is |@(@) |?, @ being the Fourier transform of 
x(t). This power spectrum is related to the autocorrelation function 


C(r) = ae . -£9 (7.131) 


12. Stochastic Processes 441 


introduced in Chapter 4, by the equation (4.58): 
|p(@)|? = 5— F (C(r)) (7.132) 


¥#(  ) designating the Fourier transform of the quantity in the brackets. 
Since n? = f x(t) dt by definition, it in fact cancels out of (7.132). 

The preceding considerations apply to any signal. The interest in 
their application to random ones is that it is often relatively easy to 
determine the autocorrelation function, though not x(t) directly, since 
x(t) is not analytically defined. 

An example of a random signal to which the theory can be applied is 
the following: x(t) is either equal to x» or — Xo, the probability of changing 
from one to the other being A per unit time. Because x*(t) = x9”, which is 
not integrable, we take the signal to be zero outside the interval (— 7/2, 
T/2); however, T may be taken to be as large as we wish. 

Although the value of this signal cannot be specified at any given 
time, it is rather easy to calculate its autocorrelation function. For in a 
time 7, the probability of n changes of sign is given by the Poisson distri- 
bution [(AT)2/n!] e~*". Now if n is even the product x(t)x(t+7) is a’; ifn 
is odd, it is — a?. The autocorrelation function is, therefore, given by 


ry" Ary" 
nm even n odd 
The two sums, combined, are simply the expansion of e~*” for 7 > 0. 
Similar considerations hold for 7 < 0. Therefore, 


Ar (7.133) 


C(r) = a? 72a" (7.134) 
The power spectrum ts then 
ob? (w) = F (ear ) 
2a? | ° , 
= ——; Re e AT e-iwt dr 
(27)? 0 
az dX 


~~ PANE + eo? (7.135) 


This is known as a “Lorentzian” spectrum. 


Problem 7-22: A random signal is generated in the 
following way: A time interval (— 7/2, T/2) is divided into intervals of 
length fo. On each x(t) is equally likely to have the value a or — a. Show 
that the power spectrum is 


é 2 
5 _ a. ,@lo 
$*(c) a> 2 


442 Probability and Stochastic Processes 


Problem 7-23: We add the following problem on 
probabilities as a diversion: A and B agree to play a game of chance with 
blank cards, the rules of which are as follows: (1) on N cards, A writes any 
arbitrary set of positive numbers, all different; B is not permitted to see 
what A has written. (2) The cards are shuffled, and A turns them over one 
at a time from the top of the pack. The aim is for B to try to name the 
highest-numbered card at the time at which it is turned up. 

If B were to bet on his ability to choose correctly the highest card, 
what odds do you believe that he should demand to have a better than 
even chance of winning? How do you think these odds should depend on 
the total number of cards? 

B chooses the strategy of letting ny cards go by, noting which of them 
is the largest. He then makes as his choice the first card subsequently 
turned up which is larger than that one. Show that the optimal value of no 
is approximately N/e. What are his chances of success in this case? 

[Try this out on your friends before they have a chance to work it 
out. You may find it very profitable. ] 


PRELUDE TO CHAPTER 8 


In this chapter we introduce the mathematical foun- 
dations of quantum mechanics. The approach, and the 
notation, are essentially those of Dirac. The under- 
lying features were introduced in Chapter 2 (Linear 
Vector Spaces). 

We first discuss the basic assumptions of quantum 
mechanics in terms of linear vector spaces, the 
representation of dynamical variables by Hermitian 
operators, of quantum states by eigenvectors of these 
operators, and of the results of measurement by their 
eigenvalues. The probabilistic hypothesis is intro- 
duced, and quantum mean value defined. The signifi- 
cance of commutators in relation to the simultaneous 
measurability of two variables is shown, and the 
generalized uncertainty principle derived. 

Unitary operators are defined, and it is shown that 
the eigenvalue spectrum is unchanged under a unitary 
transformation. 

We next show how to adapt the theory for vari- 
ables with a continuous spectrum. The most important 
such variables are coordinates. The coordinate repre- 
sentation is then used to introduce the Schrodinger 
wave functions (as the amplitudes of coordinate eigen- 
states). Operators corresponding to classical dynami- 
cal variables are derived. 

The momentum representation 1s then introduced; 
it is shown that this is very similar in structure to the 
coordinate representation, and is related to it by a 
Fourier transformation. 

We next turn to dynamics, that is, the theory of 
the time evolution of quantum systems. The time 
evolution operator is shown to be a unitary operator. 
The Schrodinger representation (time-dependent 
state vectors, constant operators) and the Heisenberg 
representation (constant state vectors, time-dependent 
operators) are introduced and shown to be equivalent. 
We then develop the theory of the “interaction repre- 
sentation’”” which lies between the two. It is used to 


443 


444 


Prelude to Chapter 8 


write in an explicit (but not very useful) way, the state 
vector at an arbitrary time. 

We return to the time evolution operator to study 
the history of a system with a given (nonsoluble) 
Hamiltonian. The concept of a “propagator” is intro- 
duced through the Schrodinger wave mechanics, and 
calculated for free particles. A simple diagrammatic 
description is given for the evolution of a system acted 
upon by a perturbing potential. 

The next sections are concerned with time- 
dependent perturbation theory, but from a rather 
general viewpoint. Time-independent perturbations 
are considered first. Starting with the differential 
equation of the time-evolution operator, we make a 
Fourier transformation and obtain an algebraic opera- 
tor equation for the transform. A formal solution can 
be found as a sort of perturbation series. Several 
alternative formulations are given. 

A practical obstacle to the use of the perturbation 
series for purposes of calculation is that mathematical 
divergences occur associated with transitions back 
into the original state in its subsequent history. Using 
projection operators into and out of this state, we find, 
after some fairly complicated analysis, that all “‘closed 
loop’’ processes, in which the system returns to its 
initial state, can be incorporated into a “renormaliza- 
tion” of that state; a perturbation expansion may then 
be developed ignoring such processes. Finally, by 
taking inverse Fourier transforms, formal expressions 
for the matrix elements of the evolution operator 
between the initial state and arbitrary final states may 
be given. 

We then set about to calculate the total transition 
probability of a system in an arbitrary time interval. 
Working only to the second order, we find explicitly 
the shift in energy of the initial state and its “width” 
(probability per unit time of decay) as a result of the 
closed loop processes. The decay is in particular 
shown to be exponential. 

We then determine the transition probability to a 
definite final state. For sufficiently large values of f, it 
is found that the probability peaks for states whose 
energy is close to that of the initial state. When we 


Prelude to Chapter 8 445 


then sum, or integrate, over all final states, we obtain a 
formula for the total transition probability in terms of a 
perturbation expansion. 

_ It is pointed out that this, rather than the second- 
order calculation carried out earlier, should be used to 
determine the width of the initial state. 

We next carry through the theory for a time- 
dependent perturbation, specifically, for a periodic 
one. An explicit formula is obtained for the lowest 
order of perturbation. Broadening of the initial level 1s 
here neglected. 

Finally, we turn to the theory of the perturbation 
of stationary states, when exact states cannot be 
determined. Complete perturbation series are derived 
using a method based on projection operators. Two 
different forms are given, that of Rayleigh—-Schro- 
dinger and that of Wigner—Brillouin. The first gives the 
perturbation energy explicitly; the second gives it 
implicitly. The Wigner-—Brillouin theory is usually (but 
not always) more accurate to a given order. 

Special attention is devoted to the case of the 
perturbation of degenerate states. 


REFERENCES 
Davydov, A., Quantum Mechanics. New York: Pergamon, 1965S. 
Gottfried, K., Quantum Mechanics. New York: Benjamin, 1965, Vol. 1. 
Merzbacher, E., Quantum Mechanics. New York: Wiley, 1961. 
Messiah, A., Mécanique Quantique. Paris: Dunod, 1958. 


FUNDAMENTAL 
PRINCIPLES OF 
QUANTUM 
MECHANICS 


“It is agayns the proces of Nature” 
Chaucer, The Frankeleyn’s Tale 


1. Introduction 


The mathematical foundations required for the formulation of quan- 
tum mechanics were laid down in Chapter 2 (Linear Vector Spaces). In 
this and the following chapters we develop the basic structure of quantum 
mechanics, and apply it to a selection of representative problems. We 
cannot, of course, in a book of such a broad scope as this one, give a 
“complete” account of quantum mechanics. Such treatments can, in any 
case, be found in comprehensive treatises on quantum theory (such as, for 
example, that of Messiah). Our intention is, rather, to expose the mathe- 
matical structure of the theory, and to call attention to certain concepts 
and techniques of wide use in modern physics. Nor do we concern our- 
selves with problems of “rigor,” such as are dealt with in von Neumann’s 
classic work. Our aim is to provide a practical handbook for the use of 
those who wish to comprehend and to undertake quantum calculations, 
rather than a critical examination of fundamentals. Our approach is also 
more in the spirit of “mathematical physics” than of “theoretical physics” 
in that it is concerned more with fundamental techniques than with the 
evolving concepts in terms of which we attempt to understand various 
aspects of the physical world. 


447 


448 Fundamental Principles of Quantum Mechanics 


Finally, it must be emphasized that our selection of topics for dis- 
cussion is somewhat arbitrary. The selection is designed to introduce the 
reader to as many as possible of the techniques and viewpoints which are 
likely to be met by physicists in any of the special areas of current interest. 


2. Hypotheses of Quantum Mechanics 


The underlying structure of the theory is based on the representation 
of “states” of a system which are to be represented by (i.e., to be in 
One-to-one correspondence with) vectors in some suitably chosen vector 
space. The measurement of the attributes of these states (“dynamical 
variables” or ““observables”’) are described in terms of operations on these 
vectors. These operations are assumed to be linear. The operation of an 
operator on a vector is intended to describe a physical operation (meas- 
urement) on the system. The relation between the mathematical operation 
and the physical “measurement” is assumed to be the following: that the 
result of a “measurement” of a dynamical variable must be an eigenvalue 
of the linear operator representing that dynamical variable. The state in 
which the dynamical variable has that value is represented by the corre- 
sponding eigenvector. The above statement is, of course, an imprecise 
One, and raises many questions. For instance, what is the vector space 
used to describe a given physical system? How are the operators repre- 
senting given dynamical variables to be chosen? How does one describe 
the time evolution of a dynamical system? These, and many other ques- 
tions must be answered if we are to formulate a complete working theory. 
What we have done, however is to specify the framework within which 
the theory is to be developed; a framework having its origin in the 
historical evolution of the subject. The historical process was, however, a 
tortuous one, marked by contradiction (experimental data in conflict with 
then existing theoretical concepts), hypothesis (often ad hoc, sometimes 
inspired), synthesis and generalization. The end product is much clearer 
and more coherent than the process by which it was constructed. We, 
therefore, forego historical analysis and build our structure inductively 
but with the wisdom of hindsight, in order to avoid wasting time on transi- 
tional steps and misleading sidetracks. 

Let us, therefore, set about to construct, within our specified general 
framework, a detailed theory. 

Consider first the question of the vector space by which a physical 
system can be represented. The major point to be made here is that this 
space is determined, not by mathematical prescription, but by physics. 
Since states of systems are to be represented by vectors, the vector space 
must be determined by the totality of attributes of the system. This is, of 
course, a function of our understanding, and must be modified from time 


2. Hypotheses of Quantum Mechanics 449 


to time in the light of discovery. This point may be readily illustrated by 
reference to the phenomenon of electron spin. If (to keep matters simple) 
we consider as our system the hydrogen atom, prior to the discovery of 
spin the vector space needed to describe the system would have been 
one based on the independent states of the atom as specified by three 
“quantum numbers” specifying the energy of the atom, its total angular 
momentum, and one component thereof. However, once it was recog- 
nized that the electron possessed not only these characteristics but also 
intrinsic angular momentum (“‘spin’’), corresponding to each previously 
known state it was now necessary to recognize two, corresponding to the 
two states of electron spin which were found to exist. Thus, the recogni- 
tion of a new physical attribute made it necessary to double the “‘dimen- 
sionality” of the vector space used to describe the system. That is, at each 
point of the original “‘space’’ it was now necessary to construct a two- 
dimensional manifold. Such a situation is of course well known to mathe- 
maticians, in whose language one says that the new vector space was the 
“direct product” of the original one and the new two-dimensional one. 
From every combination of one vector of the original space and one of the 
“spin space” we construct a new vector of the product space. 

The next question to be answered is, how does one choose the 
operators to represent the various dynamical variables? 

Let us reformulate the problem in a more meaningful way. If we knew 
the complete spectrum of a dynamical variable, that is, the array of values 
which could result from a measurement of the variable, we could easily 
set up a set of vectors and a corresponding operator. Consider, for 
example, vectors represented by column matrices. If k, were the possible 
results of a measurement of the dynamical variable K, the matrix 


k, 0 0 O 
K=][0 k 0 O-::: (8.1) 


0 0 k 0 


could be used to represent the dynamical variable K. 

The eigenvalues would then be the results of measurement, as 
required; the eigenvectors, representing the corresponding states, would 
be those vectors which have all but one component equal to zero. Con- 
jugate vectors could be represented by row matrices, as shown in Chapter 
2. The magnitudes of such vectors are in each case the square of the 
nonzero element. If the nonzero element is chosen to be unity, the 
magnitude is unity, and the vectors are normalized. 

These eigenvectors can be used as a basis for a vector space, in 
which the general vector is 


450 Fundamental Principles of Quantum Mechanics 


ay I 0\ 0 
Qo 0 ] 0 


az }=—a,f 0 j+a.f 0 I+a,f | J+--:- (8.2) 


[These vectors in general have an infinite number of components.] 
If the a’s are complex numbers, we have a complex vector space. This 
is required to formulate the quantum theory. 

We have interpreted the “unit vectors” as representing states in 
which the dynamical variable K has specific values. What, then, is the 
meaning of a general vector; what sort of state does it represent? If we 
designate the eigenvectors (eigenstates) of K as “pure states’ with 
respect to this dynamical variable, a general state is a superposition of 
pure states. It is necessary to interpret the components of the vector in 
relation to the measurement of the dynamical variable. For this purpose, 
we introduce a new and fundamental hypothesis. If we normalize the 
vector, that is to say, multiply it by such a constant value that the square 
of the magnitude of the resulting vector is unity, this hypothesis may be 
stated in the following terms: that the squares of the magnitudes of the 
components (that is, of the coefficients of the eigenvectors) represent the 
probabilities that a measurement of the dynamical variable K on the 
system in question will yield the corresponding eigenvalue. Thus, the 
probability that the result of a measurement yields the value kK, is given 
by |a,|2. The normalization of the vector assures the condition 


> lanl? = 1 (8.3) 
that the probabilities add up to unity. 


In the notation of Dirac, as outlined in Chapter 2, the dynamical 
variable (operator) K has eigenvectors |k,) and eigenvalues k,; that is, 


K|kn) = knlkn) (8.4) 


There is, however, a possibility that there is more than one state in which 
the operator K has the eigenvalue k,. In this case, the system is said to 
have a degenerate spectrum with respect to K. If there were several such 
states, they would have to be differentiated by some other attribute. Thus, 
there would have to be another dynamical variable or observable of the 
system such that the result of operating on a member of the degenerate 
set of eigenvectors of K with its operator would produce different 
results. Taking account of the totality of independent states of the 
system, as specified by all its attributes, the operator (8.1) may have to be 
expanded so that some of the eigenvalues k,, are repeated. The expanded 


2. Hypotheses of Quantum Mechanics 451 


set of basis vectors delineates all the possible states of the system. They 
also form a complete orthonormal set. An arbitrary state vector for the 
system, |W), is expansible in terms of these states, which we again 
designate |k,.): 

IY) =D [kad (kal ¥) (8.5) 


and the probability that a measurement of the dynamical variable K in 
the state |W) is k, is |(k,|V)|?. 

We now demonstrate the following result: that the mean or expecta- 
tion value of the variable K in the state |'V) is 


K=(¥|K|¥) (8.6) 
This follows immediately from the expansion (8.5): 
CULK|Y) = > Cl Km) (Kl Kl kn) (kul ¥) 


But 
(Kim| K|Kn) = kn (Kim| Kn} 


= knOnm 
by virtue of the orthogonality of the states |k,,). Therefore, 
K =2 (Wk) knSam (Kn|Y) 


= ka (Vln) Cal ®) 


=> kal Ani) |? (8.7) 


by virtue of (2.11). But (8.7) defines the mean value. 

By Theorem B of Chapter 2, the fact that the expectation value (8.7) 
is real permits us to conclude that K is a Hermitian operator. K is in 
fact any operator representing a dynamical variable (and which must 
therefore have real eigenvalues). Consequently, all physical dynamical 
variables must be represented by Hermitian operators. 

All of the foregoing results are valid for any particular dynamical 
variable. The problem which confronts us in constructing a theory is, 
however, the following: We must specify how the operators representing 
other dynamical variables operate on the eigenstates of K (or the con- 
verse). 

It is convenient to formulate this problem in terms of the commutator 


LK, K2] = K,K.—K2K, (8.8) 


of pairs of operators K, and Kz. We begin by proving the following 
theorem: 
The necessary and sufficient condition that K, and Kz possess 


452 Fundamental Principles of Quantum Mechanics 


common eigenstates is that their commutator be zero, that is, 
KK. = KK, (8.9) 


It is easily seen that if the states |n) are common eigenstates of K, 
and K., having the respective sets of eigenvalues k,, and kz,, Eq. (8.9) is 
satisfied for all operands. For since, by hypothesis, the vectors |n) com- 
prise the totality of physically realizable states for the system, they forma 
complete set. Hence any state vector |‘) may be expanded in terms of 
them, 


y= In) (nl) (8.10) 


But then 
K,K,V = K,Kz > |n)(n|¥) 


= K;, > |n)(n|V) ken 


n 


= |n)(n}¥) kink 


But K,K,|V) obviously has the same value. Thus we have shown that 
K, and K, commute. 

Consider now the converse proposition, that if K, and K, commute, 
they must possess a common set of eigenvectors. The proof is again 
immediate except in the case of states which belong to a degenerate sub- 
set with respect to both K, and K,. Excluding this case, suppose that 
state |n) is anondegenerate eigenstate of K,. Then we know that 


K,(K2|n)) = K2K,|n) 
_ kyn( Koln) ) 


Thus, K,|n) is also an eigenstate of K, with eigenvalue k,,. But since |n) 
is nondegenerate with respect to K,, this must be in fact the state |n) 
itself, or rather, a multiple of it. Therefore, 


K,|n) = constant |n) 


which says that |n) is also an eigenstate of K,. Since K, is Hermitian, the 
constant must be real. 

If jz) were nondegenerate with respect to K,, whether or not it was 
degenerate with respect to K,, the above argument could be made with the 
roles of K, and K, interchanged. 

Consider then the only remaining case: Suppose that states |n,) 
(a=1, 2,..., p, say) form a degenerate (but orthogonal) set of eigen- 


2. Hypotheses of Quantum Mechanics 453 


states of K, with eigenvalue k,,, which are also degenerate with respect to 
K,. But then any linear combination 


|e) = x gale) (8.1 1) 


are eigenstates of both K, and K,. Our theorem 1s therefore proven. 

Let us now interpret physically the above result. Let us call an eigen- 
state of K, a “pure state” with respect to this dynamical variable. If K, 
and K, commute, this can also be a “‘pure state” with respect to K,. Thus, 
measurements of K, on a system in this state are certain to produce a 
value k, while measurements of K, are certain to produce a value k,. 
Measurements of these two variables do not “‘interfere’’ with each other. 

Consider, however, the alternative possibility, in which |¥,) is an 
eigenstate of K, with eigenvalue k,,, but is not an eigenstate of K, due to 
the fact that K, and K, do not commute. Then |¥,,) may be expanded in 
terms of the eigenstates |y,,) of K2, whose eigenvalues are kom, 


[v,) = 2 Om) <Pm| Yn) (8.12) 


Then, the result of a measurement of K, in an eigenstate of K, must 
be expressed in terms of probabilities: In the state in which K, is known 
to have the value k,,, the probability that K, has the value ky is |(@m|Vn) |? 
(we have of course assumed each set of eigenvectors to be normalized). 
Similarly, if we select a state in which the value of K, is known, it is not 
‘pure’ with respect to K,, and various results of a measurement of K, are 
possible, only their respective probabilities being determined. In this case 
we say that the measurements of these two variables “interfere” with 
each other. 

If a measurement of K, produces the value k,,, then conversely, we 
can surmise that the system is in the state |V,,). Thus, the measurement 
has, in a sense, prepared the system in that state, and the ‘““measurement”’ 
of that variable has “‘selected”’ the system in the corresponding state. This 
selection is represented mathematically by the operation of projection on 
the eigenstate. If we start with a state 


w= > Wd YIY) (8.13) 


after the measurement which has given the value k,, we have produced 
the state represented by the term |¥,,) (V,|¥) in the expansion. The 
“projection operator” 


Pa = |Wn) (Val (8.14) 


is then the mathematical operator describing the effect of the measure- 
ment on the system. 
A simple illustration of the interference of measurements arises in 


454 Fundamental Principles of Quantum Mechanics 


discussion of the Stern—Gerlach experiment. This experiment demon- 
strates the existence of an intrinsic magnetic moment (and therefore 
angular momentum) for an electron. One may discuss this experiment in 
terms of a single electron, taking account of the probabilities of different 
contingencies. Alternatively, one may consider an ensemble, or beam, of 
electrons with the same momentum, treating them, however, as though 
they were isolated systems. As is seen later, the electron can exist in 
two spin, or magnetic moment, states. Its component of spin in an 
arbitrary direction, which we call the z direction, may be $f or —$h; the 
magnetic moment e/mc times these values, e being the electron charge, 
m its mass, and c the velocity of light..4 is the Planck constant divided 
by 27. 

If we take a beam of electrons, with random spin states, and pass 
them through a magnetic field in the z direction, those with opposite 
spins interact differently with the field. Those whose magnetic moments 
in the z direction are positive have their energy lowered in the field, while 
those whose magnetic moments are negative have theirs raised. If the 
field is inhomogeneous, the former will be deflected in the direction of 
increasing field, the latter in that of decreasing field. This results in a 
splitting of the beam; it in fact performs a physical separation of the 
electrons in the two different spin states into two distinct beams. 

The electrons in the two beams now are individually “‘pure” with 
respect to the z component of spin, or magnetic moment. Let us now try 
to measure another component (say the x component) of the spin, by 
similarly passing one of these beams through an inhomogeneous magnetic 
field in the x direction. If the x and z components of spin commuted, the 
beam would be “‘pure”’ also with respect to the x component of spin, and 
no further splitting would occur. In fact, however, it is found that this 
beam is in turn split into two statistically equal parts. Thus, electrons 
whose z component of spin is known seem to have equal probability of 
having one or the other x component. 

If now we consider one of the beams split by this second field, the 
electrons in it will be in an eigenstate of the x component of spin. This 
beam will, then, again split equally when put.through a second field in the 
z direction. Although a selection was originally made of electrons with a 
given z component of spin, the measurement of the x component so dis- 
turbed the beam that after the measurement the two values of z com- 
ponent were equally likely. 

The above discussion serves to indicate the importance in quantum 
mechanics of the commutator of two operators. As we have seen, if 
they commute, they may simultaneously be ascribed exact values. Let 
us now see in a More precise quantitative way the consequences of having 
a nonzero commutator. 


3. Uncertainty Principle 455 


Let us note first that the commutator as defined by Eq. (8.8) is not a 
Hermitian operator. To see this, let us first evaluate a general matrix 
element of the product K,Ko, 


(n|K,K.|n') = p (n| K,|n") (n"|K,|n’) (8.15) 


In the case of a Hermitian operator, as was seen in (2.60), any matrix 
element is equal to its transpose conjugate. But the transpose conjugate of 
(8.15) is 


(n'|KiKa|n)* = & (n'|K,|n")* (n'|Kaln)* 


which is, since K, and K, are Hermitian, 

> (n|K.|n") (n"|K,|n') = (n|K2K,|n’) 
Similarly, the transpose conjugate of (n|K.K,|n’) is (n|K,K,|n'). There- 
fore, 


(n'|K,Ke—KeK,|n) = — (n|K,K.— K,K,|n')* (8.16) 


We may, however, very easily create a Hermitian operator by 
multiplying the commutator by i; that is, 
C = i(K,K.— K2K,) = i[Ky, Ko] (8.17) 


is Hermitian. 


3. Uncertainty Principle 


We now prove the following result, known as the Heisenberg un- 
certainty principle: The product of the uncertainties of two dynamical 
variables is greater than or equal to half the magnitude of the expectation 
value of their Hermitian commutator C, that is, 


AK, AK, = 3|C| (8.18) 


We must, however, explain what we mean in this context by the 
“uncertainty” AK of a variable K. The meaning is precise — it is the root 
mean square deviation from the mean. In a state |W), the mean value of 
Kis 

K = (¥|K|¥) 


The mean value of the operator (K — K)? is then the mean square devia- 
tion from the mean: 


(AK)? = (¥|(K —K)?|¥) (8.19) 


456 Fundamental Principles of Quantum Mechanics 


and so the uncertainty is 


AK = V(v|(K—K)?|¥) 


(8.20) 
If we introduce the operator 
K'=K-K (8.21) 
it follows that 
(AK)? = K” 
Furthermore, 


C = i(K,K.— K,K;) 
= i(KiKi— K3K}) 


The uncertainty relation (8.18) may now be proven as follows: If |‘) 
is an arbitrary vector and A is an arbitrary real constant, the magnitude of 
the vector 


(Ki + iAK3)|¥) 


is greater than or equal to zero for all A. Thus 


(| (Ki —iAK3) (Ki +irKz)|V¥) = 0 (8.22) 
K, and K, being Hermitian. But from this it follows that 
h2(A Ke)? +AC + (AK,)? = 0 (8.23) 


for all A. This may be written 


Ae) al Ate 
— > 
(MaKe+ 5g.) + [AK —qraR DR BO 
The inequality must therefore hold in particular for the value of \ which 
makes the first term zero, viz., 


\ =—C/2AK,AK, (8.24) 
It follows that the second term is non-negative, so that 
AK, AK, = 3\C| 


as was to be shown. 

We see then from the preceding considerations, that the result of a 
measurement of a dynamical variable in a physical system “prepared” by 
measurement of another variable depends on the commutator of the two 
variables. The essential substantive hypothesis of quantum mechanics 
must then concern the specification of such commutators. 


3. Uncertainty Principle 457 


It is necessary, in making this hypothesis explicit, to distinguish 
between “classical” and “‘nonclassical’’ dynamical variables. The former 
are defined in terms of the classical generalized coordinates of the system 
q; and their conjugate momenta p;. Thus, such a variable is a function 
K(q;,p;) of all these coordinates and momenta. Energy, position, mo- 
mentum, and the angular momentum of a particle about a point are 
examples of such variables. 

The commutation rules for such variables can be expressed in terms 
of the Poisson bracket of classical mechanics. This we define, for two 
variables K,(q;, p;) and K2(q;, P;), as 


= an ee 
(Ki, Ka) 2 Ge OP; 9G; OP; Ce) 
This Poisson bracket is another dynamical variable. The basic hypothesis 
of quantum mechanics is then this: that the commutator [K,, K.] is ih 
times the dynamical variable (K,, K,), 


[K,, Kz] as ih( Ky, K;) (8.26) 


both sides being operators.’ 

It is important to realise, however, that the operator on the right-hand 
side of this equation must be chosen to be Hermitian. Consider for 
example pq, where q is a generalized coordinate and p is its conjugate 
momentum. If p and gq are individually Hermitian, neither pg nor qp is; in 
fact, 


i 


(pq)t = QP (8.27) 


We therefore represent the classical variable pg by the quantum operator 


2(pq+ gp) 


which is Hermitian. Equation (8.26) must, therefore, be understood in 
light of this restriction. 

A system is of course specified by a set of independent dynamical 
variables, equal in number to the number of its ““degrees of freedom.”’ 
We see that the states of the system are determined only by the algebra of 
the operators as given by (8.26). It may be convenient to use particular 
representatives of the operators for purposes of calculation, but it is not 
necessary. 

It is shown by a rather tedious calculation that, for classical dynami- 
cal variables, (8.26) follows from the relations 


(gi, Pi) = hd (8.28) 


IP. A. M. Dirac, Principles of Quantum Mechanics. London: Oxford Clarendon Press, 
4th ed., 1958, Chapter 4. 


458 Fundamental Principles of Quantum Mechanics 


Consequently, these relations may be taken as basic, and determine the 
commutation relations of all pairs of such classical variables. 

The study of quantum systems has, however, revealed that there 
exist “dynamical variables” which have no classical analogs. A well- 
known one is the “spin’’ or intrinsic angular momentum of a particle. 
Parity, isospin, and strangeness are others which occur in the description 
of fundamental particles. It is of course necessary to prescribe the 
commutation rules satisfied by these variables—both those satisfied 
between these variables themselves and those between each of them and 
the classical variables. 

These nonclassical variables divide into two categories. Consider as 
an example of one type, the parity operator. This simply changes every 
classical coordinate and momentum into its negative. It does not, then, re- 
present a new “degree of freedom,” but relates toa symmetry property con- 
nected with the classical ones. Its commutation rules with these classical 
variables are then determined by the definition of the operator itself. 

The other type of nonclassical operator is one which relates to a 
new degree of freedom having no classical analog (e.g., spin). In this case 
the appropriate commutation relations must be chosen to represent the 
physical character of the operators. The new dynamical attribute must be 
represented by an appropriate vector space at each point of the “classical 
space,” 1.e., of the space on which the classical dynamical variables 
operate. Because they operate in different spaces, classical and non-classical 
variables are represented by independent operators and so commute 
with each other. The new assumptions which must be made then concern 
only the commutation of nonclassical operators among themselves. 

Consider the case in which the vector space of the system decom- 
poses into two subspaces S, and S,; the system is represented by inde- 
pendent pairs of vectors |n,) and |n,.). We can represent by 


|yN2) = |1) |ne) (8.29) 


the vectors of the “product space”’; |n,) and |n.) are vectors within the 
separate spaces S, and S,. To each new dynamical attribute which we 
discover there corresponds a new space independent of the others, and a 
new multiplicity in the “product space’’ which is the complete Hilbert 
space of the system. 


Problem 8-1: A particle moves in a potential, and 
has energy eigenvalues E, in states |). Prove that 


22S Mnlx|0)|? (Ex— Eo) = | 


where |0) is an arbitrary bound state. (Consider the quantity [x, [x, H]].) 


4. Unitary Operators 459 


Problem 8-2: Prove that 
[4,[B, C]]+[B,[C, A]] +[C, [A, B]] =0 


Problem 8-3: Prove that 
[A, BC] = [A, BJC+ B[A, C] 


Problem 8-4: If H = p?/2m+ V(x) show that 
wal _1 
x= h, LH, x] _ mP 
Hence show that, if |a) and |b) are arbitrary eigenstates, 


_ ih (alp|b) 
(a|x|b) ~~ mE,—E, 
Problem 8-5: Using (8-4), prove the “sum rules” 


1} 
E (alple) <elel6) eget eg} = 


and 


4. Unitary Operators 


We define an operator U as unitary when its Hermitian conjugate is 


its inverse: 
Ut=U" (8.30) 


The following results represent important properties of unitary operators: 
(a) Under a unitary operation the magnitude of a vector remains 
unchanged. For let 


lw’) = U|P) (8.31) 
Then 
(W' = (WOT = (| 
Consequently 
(PW) = (P/U 7U |) 
= (WV) (8.32) 
(b) The inverse of (8.31) is 


Iw) = U8) = UTP") (8.33) 


460 Fundamental Principles of Quantum Mechanics 


Now the matrix element of an operator K between states |V,,) and |V,,) 
is 


(UnlK|Vn) = (Vn|UKUT|Y,) (8.34) 
If we define the unitary transform of K to be 
K’ = UKUt = UKU“ (8.35) 


then the matrix representing a dynamical variable K remains invariant 
under the unitary transformation. 


Problem 8-6: If is areal constant and M a Hermit- 
ian operator, the operator 


U =e" 
iS Unitary. 


Problem 8-7: The product of two unitary operators 
U, and U, is also unitary. 


Problem 8-8: The operator (1 —iK)/(1+iK), where 
K is Hermitian, is a unitary operator. 


(c) The transformation of an operator K under the unitary trans- 
formation generated by the operator e*” is 
K' = &"“Ke™ 


It follows that 


ae = je [M, K] e*™ (8.36) 
which we write as 
~ = i[M, K’] (8.37) 


(d) A change of basis corresponds to a unitary transformation. 
Suppose we transform from a basis |¢,) to another basis |y,). The 
vectors of these two bases are taken to correspond in pairs |¢g,) trans- 
forming into |x,). The transformation is immediately seen to be repre- 
sented by the operator 


W =) |xn) (Gr (8.38) 


Its Hermitian conjugate is then 


Wt => lm) (Xml (8.39) 


5. Representation of Vectors 461 


Therefore, 


WWi= > IXn) (PnlPm) (Xml (8.40) 


n,m 


= > Xn) (Xn 


= 1 


so that W is a unitary transformation. 

(e) The eigenvalue spectrum of an operator and its expectation 
value in a given state are invariant under a unitary transformation. 

The eigenvalue equation for a variable K 1s 


K|V) =k|¥) 
Substituting for |W) in terms of |W’), obtained from it by the unitary 
transformation U, we obtain 
KU |¥') = kU" |V') 
Multiplying this equation by U gives 
UKU7|¥"') = K'|'¥') = k/¥’) 


by virtue of (8.35). Therefore, a unitary transformation does not change 
the eigenvalue problem. 
As for mean values, the mean value of K 1s 


K = (¥|K|¥) 
= (W'U|UK'U|U-"8’) 
— (W'|K' |v’) 


so that the mean value is invariant under a unitary transformation. 


5. Representation of Vectors by 
Eigenvectors of Continuous 
Coordinates 


Consider a coordinate g, whose eigenvalues we assume to be arbi- 
trary within some range (q;, g.) (which could be » and —~, respectively). 
Designating the eigenvectors by |ga), where a@ refers to other quantum 
numbers, we then have 


q\q'a) = q'|q'a) (8.41) 
We henceforth omit the a for convenience. 


When vectors are specified by a continuous eigenvalue, the problem 
of normalization becomes a bit more involved than in the discrete case. 


462 Fundamental Principles of Quantum Mechanics 


The expansion in terms of a sum over discrete eigenstates becomes one 
in terms of an integral over the continuous states: 


Iv) = f la) (al¥) da (8.42) 


The magnitude squared of this vector is 
(WI) = ff C¥la’) (a'la) <al'¥) dada’ 
On the other hand, taking the scalar product of (8.42) with (‘¥|, 


(YY) = [ C¥lq) <al¥) dq 
Consequently, 


(lq) =| (W]q') (q'\q) dq’ (8.43) 


We see, then, that (q’|q) is precisely the quantity which is defined as the 
5 function 6(q’ — q), that is, 


(q'|lq) = &(q' — 9) (8.44) 


Thus, the vectors |g) are mutually orthogonal. However, they are not 
unit vectors. 

It is an immediate generalization that, if a system is characterized 
by a number of coordinates g,(i= 1,2,..., N), one may write 


IY) = f las.---,4y) (aa... aul) day... day (8.45) 


where g;,...,Qy iS a simultaneous eigenstate of the coordinates [which. 
commute by virtue of (8.26) and (8.27)]. The normalization of these 
vectors is given by 


(Gis Gos +++» GnlGir Gas - ++ nu) = 8(Gi — G:) (Go — Ga) --- Gu—Gy) (8.46) 


6. Schrdédinger Wave Equation 


The wave mechanics of Schrédinger is a particular representation of 
the general theory of quantum mechanics, in which one chooses as basis 
vectors the eigenstates of the coordinates of the system considered. We 
must first specify a representation of the momentum operator p corre- 
sponding to the coordinate q. 

For this purpose we introduce another operator associated with a 
“translation,” that is to say, a change of g by aconstant 6: 


Ts|q) = |q+ 8) (8.47) 


6. Schrddinger Wave Equation 463 


If this operates on a general vector |‘) we obtain 
Ts|¥) = Ts { lq) <al¥) dq 
=f |q+8)(al¥) dq 


= { |q) (q—8|¥) dq (8.48) 
Let us then introduce the operator 


_k T;5— | 
= lim 5 (8.49) 
If this operates on |W), we obtain 
= Tye _ 
al) = lim | 1a) 3 (g—2l¥) — (al¥)} ag 
_ _ 9 
= | Iq) ( aq (al¥)) dq (8.50) 
It follows that 
_ _ 9 Oo 
(6q— 9) |¥) = | lq) 9g UY) +456 (aiv)| dq 
= | lq) <al¥) dq 
=—|¥) (8.51) 
Therefore, the commutation relation between 6 and gq is 
[(0,q)/=—1 (8.52) 


Comparing with (8.28), we find that p has the same commutation relation 
with g as ih@ has. We may therefore use ih@ as an operator representative 
of p. Specifically, then, we may write 


p\¥) = | |g) (—in =) (q|V) aq (8.53) 


[see (8.50)]. 

Schrodinger wave mechanics is based on the representation of Hilbert 
space vectors in terms of the vectors |q). The scalar product (q|¥) is 
known as the “Schrédinger wave function.”” The momentum operator is 
defined in terms of its operation on this wave function. Since the wave 
function is the coefficient in the expansion of the vector, the operator on 
the wave function performs an operation on the vector itself. 

From the normalization condition for the state vector |W), namely, 


(YY) =1 


464 Fundamental Principles of Quantum Mechanics 


we may, on expanding in terms of coordinate eigenstates, obtain the 
normalization of the wave function. For 


(W%) = [ dq { dq’ (¥lq') (a'|a) <al¥) 
= [ dq(¥\q) (al¥) 
by virtue of (8.44). Therefore, 
(|e) = f dal (ql¥)? 
= 1 (8.54) 


If we write an eigenvalue equation in terms of the eigenvectors of 
|q), this is known as a “Schrodinger wave equation” (though this designa- 
tion is sometimes confined to the energy equation). 

The generalization to a system with N coordinates q,, qo,..., Gy 1S 
straightforward. Corresponding to each coordinate is a momentum 
operator on the wave function 


aap 
pi=— ine (8.55) 
the wave function Is 
Wd, - - > Aw) = (Gis Gas- ++ nl V) (8.56) 
where |q;,.-..,qw) and (q;,...,qy| are the ket and bra vectors which are 


simultaneous eigenvectors of all the coordinates. 
The wave equation for the energy may. be obtained from the eigen- 
value equation 


H(q:, pi)|V) = El) (8.57) 


if no nonclassical variables are involved. H(q,, p;) must, of course, be 
Hermitian. From general dynamical theory, it must be quadratic in the 
p;s. A term 


FiQas + ~~ > In) PwPs 
can be represented by the Hermitian operator 
2SuPiPst Piprfis (8.58) 
When an operator function of g, f(g), operates on |V) we get 


Ai@l¥) =fl@ { la’) (a'|¥) dq’ 


= | lq) Ag) (al¥) dg (8.59) 


where the f(g) in the integral is now an ordinary function. (The “‘q” here 
may stand for all of the coordinates q;,..., Gy.) The operator p;, operating 


7. Momentum Representation 465 


on (8.59), would give 
psa lv) = { 1a) (—i& 2) (fla) <al¥)) da (8.60) 
We may therefore write (8.57) as 


H(qipi) { a’) (a'|¥) da’ = E f |a') Ca’) a’ 
or 


[ ay a (a,-aZ) a dq’ = E | \a') (q'|V) dq’ (8.61) 


the operator H(q!, —ih(a/aq/)) being so constructed as to be Hermitian; 
the construction is formally the same as that for the operators on the 
vectors |‘). If we now take scalar products with the bra vector (q|, and 
use the property of the 6 function that 


[ F(q')8(q' — @) dq’ = F(a) 


we obtain 
H (4. =) (ql¥) = E(ql¥) (8.62) 


The eigenvalue problem then becomes one of differential equations. 


7. Momentum Representation 


Let us consider in this section the particular case of Cartesian 
coordinates x;.... It is possible to exploit the considerable symmetry of 
the equations of dynamics between the coordinates and their conjugate 
momenta, to obtain another representation of the quantum equations in 
which the eigenstates of the p,’s rather than those of the x,’s are used as 
basis states. For simplicity in writing, we use the single symbols x, p to 
stand for the 3N coordinates and momenta ~;, p; (i= 1,2,...,3N) of the 
N particles of the system. We also use the short notation 


3N 
x-p= > xp; (8.63) 
i=1 


In the coordinate representation the eigenvalue equation for the 
momentum Is 


— iis (x|p) = p:{x|P) ey 


466 Fundamental Principles of Quantum Mechanics 


This has as a particular solution 


(x|p) = Aer xt = 4 ep uit (8.65) 
where A is a normalization constant. In view of the fact that 
(p|x) = A*e-@ xt (8.66) 
we may write 
J <p'lx) (x[p) d5%x = f ef ?** da" x4? (8.67) 


= (27h)*" 8(p — p’) |A|? 


For normalization, therefore, let us take 


1 3N/2 a 
(xlp) = (525) eps (8.68) 


Let us now write the Schrédinger function (x|‘¥) in terms of momen- 
tum eigenfunctions. We know that 


(x'|W) = > (x’[p’) (p'|¥) (8.69) 


p’ 


where p’ represents the eigenvalues of momentum of a particle in an 
infinite volume. Thus 


1 \3N2 | 
(1) =(s23) | (wl) exp pi-xih- dp’ (8.70) 


This is a Fourier series expansion, which in the limit has become an 
integral. 

Let us now see how to express dynamical operators in terms of their 
effect on the momentum wave functions, that is, on the amplitudes 
multiplying the momentum eigenvectors. 

We may write 


x;|'¥) =x; [ 2" x’ d"p'|x’) (x'[p’) (p'|'¥) 


1 
= 75-7 ana | 22°Nx’ d3Xp' lx’) x) ePixilt (p' |W) 
(27h) 


! | — 
= ——_—_—__ (— t IN ny! |y’ —— pikpyrglh ’ 
(Qarhyen | i) | Smee we) Lor’ " |e ba 


This may be integrated by parts: Assuming that (p’|‘V¥) vanishes rapidly 


7. Momentum Representation 467 


enough at infinity that the integral vanishes there, we obtain 


IY) = oop | at’ dp" lx’) ewes (it 2,) (p'lW) 
=| d3%y' d3%p'|x’) (x' |p’) (i =] (p’|v) 


OD; 
= | ap'tp'y (i 5 
On the other hand, of course, 


pil¥) =p: { d"p'lp’) (p'|¥) 


=) (p' |) (8.71) 


= { dp’ |p’) pi (p'|¥) (8.72) 


The situation in the momentum representation is, then, exactly the re- 
verse of that in the coordinate representation. In the latter, position 
coordinates are represented by multiplicative operators on the wave 
functions, and momentum variables by differential operators, as shown in 
(8.53). In the momentum representation, momentum operators are 
multiplicative (8.72) and coordinate operators are represented in terms 
of derivatives with respect to momentum variables [see (8.7 1)]. 

Whether or not the momentum representation is useful for setting 
up the eigenvalue problem (it is in fact useful only in rather rare cases), the 
momentum wave function (p|V) may be obtained directly from the co- 
ordinate wave function (x|¥) and may provide useful information about a 
system. The relation between the two wave functions is obtained as 
follows: 


(x|¥) = f a" p'(xip’) (p'¥) 
2 ] ai 3Ny,! pip’ -x/k , 
= (4) d3Np! e'?-*8 (p'|Y) (8.73) 
and, conversely, 
(pl'¥) = f d°%x'{p|x’) (x’|¥) 
3N/2 
=(—5 +) | ame e— iP xix’ |P) (8.74) 


These two wave functions are, then, effectively, Fourier transforms of 
each other. More precisely, in terms of the definition of transforms given. 


468 Fundamental Principles of Quantum Mechanics 


in Chapter 5, 


(a) ly) 


is the Fourier transform of (x|‘V). 


8. Time Evolution of a System 


In classical mechanics, the equations of motion of a dynamical 
system with Hamiltonian H(q;, p;) are 


_ 0H 

45 ~ Bp, (8.75) 
and 

_ oH 

Pi 9g; (8.76) 


the dots designating derivatives with respect to time. The rate of change 
of a dynamical variable K(q;, p;, t) is 


dK aK (aK. aK. 
dt at +2 (5g, B+ 5p-) 


ot F 0q; OP; 9p; 9q; 
K 
- oe +(K, H) (8.77) 


where (K, #7) ts the Poisson bracket of K and H as defined by (8.25). But 
now, by the hypothesis (8.26) the commutator of two variables, K and H 
in particular, is if times their Poisson bracket. 

Consider for the moment dynamical variables K which do not 
contain the time explicitly, that is, for which 


aK 
vis 0 (8.78) 
then 
[K,H]=ih(K,H) 
. aK 
= ih i (8.79) 


where the right-hand side of this equation represents ii times a new 
dynamical variable which is the time derivative of K. Thus, if K were a 
coordinate, dK/dt would be the corresponding velocity, and so forth. 


8. Time Evolution of a System 469 


Equation (8.79) may, of course, be inverted to give 


dK _ 
dt 
both sides of this equation being operators. 


If we consider a system evolving with time, (8.80) may be used to 
describe its evolution. In fact, it may be verified that 


K(t) — ethtin K(0) e tHun (8.8 1) 


defines a time-dependent K operator. 

It was on this basis that Heisenberg described the time evolution of a 
system. Using a constant set of basis vectors, the time variation of the 
expectation value of any dynamical variable of the system in any state 
|) can be obtained by taking the expectation value with respect to |W) 
of the time-dependent operator K(t): 


(W[K(O)|V) = (Vlei? K(0) 44" |W) (8.82) 
(The time ¢ = 0 is of course arbitrary.) 


However, Eq. (8.82) can be interpreted in another way; if we define 
the time-dependent state vector 


Oa it 
—7 IK, H]=5[H. K) (8.80) 


| W(t)) = eA) = e HUAI (0) ) (8.83) 
the expectation value of the variable K can be written 
(W(t) |KIV(t)) = (H(t) |K (0) | (2) (8.84) 


Thus, the evolution of the system with time can be described by using a 
fixed operator and a time-dependent state vector. This method of de- 
scribing time variation 1s known as the Schrédinger representation, as 
contrasted with that in which the time variation is incorporated in the 
dynamical variable, viz., the Heisenberg representation. It is evident 
that they provide completely equivalent descriptions of a physical system. 

If we use the Schrddinger representation, it is clear that the time- 
dependent state vector |W(t)) satisfies the equation 


nO _ nwo) (8.85) 


It is clear from (8.83) that if the state of the system, as specified by |‘¥(0)), 
is known at some initial time, its subsequent state is completely deter- 
mined. It is therefore appropriate to formulate the problem of the evolu- 
tion of the system in terms of the first-order differential equation (8.85). 
The time-dependent Schrodinger wave equation, that is, the equation 

satisfied by 
Vai. t) = (ail YO) (8.86) 


470 Fundamental Principles of Quantum Mechanics 


follows from (8.84). For let us expand |¥(f)): 
| W(t) = J dq, ..-dqnl|q,.-.-. Gn) (iss +> Gn| V(t) 


= { d%qia) (ql ¥) (8.87) 


where q stands for the aggregate of generalized coordinates g;. As we saw 
in (8.61) 


H(q,p)|¥(t)) = H(q.p) f d%qlq) (al'¥(t)) 
= , 9 
= | d%q\q) H (4.-% 2) (qi¥(z)) (8.88) 


It follows that, on substituting (8.86) into the two sides of (8.84) and 
equating coefficients of individual |q)’s on the two sides, the Schrédinger 
wave function (q|‘(t)) satisfies 


ims (al¥()) =H (a2) (al¥(e)) (8.89) 


The time-dependent equation is not at all necessary, however, since 
all problems of time development of a system may be treated using either 
Eq. (8.81) or Eq. (8.83). 


Problem 8-9: Prove the ‘“‘virial theorem” in quantum 
mechanics: If V is a homogeneous function of the coordinates of order 7, 
that is, V(Ar) = A"V(r), then the mean values of the kinetic and potential 
energies (7 and V, respectively) of a particle are related by the equation 


nV) =2(T) 
(HINT: change the scale of x to write the Schrodinger wave equation in 


terms of r’ = Ar. Differentiate the equation with respect to A, multiply 
through by w*, and integrate. ] 


9. Interaction Representation 


For many problems, a representation intermediate between those of 
Heisenberg and Schrodinger is found useful. It has been developed for 
problems in which it is possible to find an exact solution for a part of the 
Hamiltonian. We consider, then, a system described by a Hamiltonian 


H=H,+A, (8.90) 


9. Interaction Representation 471 


where we know the eigenstates of H,: 


H,|n) = E,|n) (8.91) 


We call H, the interaction part of the Hamiltonian, since the situation 
most often encountered is that in which H, describes parts or aspects of a 
system which may be considered separately if certain interactions are 
ignored, while H, describes those interactions. 


An equation in which H, may be considered as the effective Hamil- 
tonian may be obtained by starting from the equation 


if a¥) (H)+H,)|V) (8.92) 


and making the substitution 


[y(t)) = efFott| yp) (8.93) 
or rather, its inverse. This leads to 
ent ih WC) = Hs eM Y(O)) 
or 
.» Owh(t 
ih ao) = H,()|p(t)) (8.94) 


where 
H(t) = ettotit FT e—tHotih (8.95) 


Equation (8.94) may be reformulated as an integral equation 
- et 
WO) = 1¥O) —E [Aye wey) a (8.96) 
fu 


This equation may be integrated formally in a perturbation series in 
powers of H,. A simple way to do this is to imagine H, to contain a 
“strength parameter” A, such that 


Hi, = MA, (8.97) 


We may then assume |V(t)) to be expanded in a power series in X, viz., 
WO) = J Alva) (8.98) 


and equate corresponding powers of \ in (8.96). This leads to the relation 


- ft 
lWat)) = | de! Hee Yea) (8.99) 


472 Fundamental Principles of Quantum Mechanics 


But of course 


[Wolt)) = |W(to)) (8.100) 


We may therefore iterate (8.99) n times to get 


|w,(t)) = (-3)" [ antts(1) [ dtottu(t) ...ntimes... 


tn-1 
| dtgH y(t,)|(te)) (8.101) 


t 


Thus the formal perturbation solution 


. i t cae 2 rt t 
wo) =[1—gf aemaen+(—z) J atatrcen | 


0 


dt.H (te) + seats 
j\n ty t 
+(-;) | dt'H,(t') | dt.H,(t.)...ntimes... 
to to 


i UAH | lW(t0)) (8.102) 
to 


can be deduced. 


10. Time Evolution Operator 
Let us define an operator U(t, t') such that 
\W(t)) = U(t.t')(¥(t')) (8.103) 
Substituting in (8.84) we see that U(t, t’) must satisfy the equation 


i < U(t, t') = HUU., t’) (8.104) 


It is simple to show that if H is Hermitian, U(t, t') is unitary. For in 
this case the equation conjugate to (8.83) is 


— ine (Y(t)| = (WO|A (8.105) 


If we now multiply (8.83) by (‘¥(z)| and (8.105) by | ‘W(t)) and subtract, 
we find that 


a [ wolzvn+(2r@)ivor]=0 


or 
< (Y(t)|¥(1)) =0 (8.106) 


10. Time Evolution Operator 473 


Thus, the magnitude of | (t)) does not vary with time, and 


(VN|VO) = (HH )[OTEL) UAL) |¥)) 
= (¥(1')[¥("')) 
It follows, then, that 
Ut(t, UC, t') = 1 (8.107) 


Consider now a special case, the rather trivial one in which we know 
the eigenstates |n) and the eigenvalues E,, of H = Hp. 
In this case, it follows immediately from (8.84) that 


|Y(t)) = |W Hote-R I (t")) (8.108) 
so that 
Ut, ) = U(t—t') = eat-in (8.109) 


The unitarity of U(t, t') is then manifest. The matrix elements of U, using 
the eigenstates of H, as a basis, are 


(n|U(t, t')|m) = SymemEntt-tit (8.110) 


Now by virtue of the completeness condition 
U=J |n) (n| UC, t’)|m) (ml 
= |n) (nleBat-oon (8.111) 


We may easily determine the time evolution of the wave function. 
If |g) represent the eigenstates of all the coordinates of the system, 
multiplying (8.103) by (q| and expressing |W (t’)) in terms of |g’) yields 


(ql¥(t)) = (qlU(tt') f lq’) (a't¥(t')) dq’ 
=f (qlU(t,0')Iq’) Ca’ W(t") dq’ (8.112) 
or 
Vig.) =f K(q.tiq'.t'¥q'.t') dq’ (8.113) 
The “propagator” K(q, t; q’, t') is then 
K(q, t; 4’, t') = (q|U@, t')\q') 


= p (q|n) (n|q’) einen (8.114) 


474 Fundamental Principles of Quantum Mechanics 


on substituting (8.111) for U. Thus 
K(q,t;q',t') = K(q,q';t—?’) 
=F dalgvgt(q’) erm (8.115) 


where 


$n(q) = (q|n) (8.116) 


Problem 8-10: Show that, for free particles of mass 
m, the propagator is 


K(r,r yf t’) aa | exp| Shan | 


Problem 8-11: Show that the Fourier transform of 
K(xt; x'0) is 


* i] 
K (x, 3 x'0) = lim n(X) 7 (x’) 


=0 27ri w— E,/h— ie 


so that the energy eigenvalues of the system are given by the poles of K. 


Let us now consider again the general case in which 


H,= V(b) (8.117) 
and consequently 


lw) = |W) (8.118) 


Equation (8.102) may be written in terms of the “true wave function” 
|\W(t)) and the potential V(t). If we introduce under the ¢; integration, 


after H,(t;), the factor 
ew iHotslh e iotslh = ] 
substitute for |y(t)), |W(¢o)) in terms of |‘W(t)) and |¥(¢.)), and put 
H(t) ae eittutIh Y(t) e7 tHotin 


Eq. (8.101) may be written in terms of Up, as follows: 


IW (t)) =| Use— 1-5 | dt U(t— ty) V(t) Uoltr — te) 


t 
zu (-i) | * dt, Ulta) | dt Uti tackle) +> 
to 


10. Time Evolution Operator 475 . 


“\n pl ti 
+(-§) f dt,U,(t—t,)V(t;) | dt, Uo(t, — t2)V(t2) ... n times 


1 ay Welt r— ta) VU by) Ulta 1) + |I%G)) 8.119) 


U,(t;— t;-:) may be referred to as the “propagator” from ¢;_, to ¢; for the 
Hamiltonian H>. It is then possible to give a diagrammatic representation 
for each term of (8.119). The first term corresponds to propagation with- 
out interaction. In the second, there is propagation from fy to t,, where the 
perturbing potential acts to make a transition; the resulting system is 
then propagated to t. The next corresponds to a second-order transition 
(‘scattering’) where the transitions take place at f, and ¢,, respectively, 
and so forth. Using a straight line to indicate propagation, and a change of 
direction to indicate ‘‘scattering,” these terms may be represented as 
follows, where a point which is above another corresponds to a later 
time: 


t t t 
U(t — to) U,(t— ty) U(t —1;) 
to V(ty) pty V(te) (ty 
U((t; — to) U((t, — te) 
to V(t) >} te 
Uo(te — to) 
Lo 
‘zero order” “first order” “second order”’. 


The square bracket in (8.119) is the operator U(?, ft). It is immediately 
verified that it satisfies the integral equation 


° t ; 
U (t,t) = Udt—t0)—F | dt,U(t—t,) VU(t, to) (8.120) 
to 


Since the time evolution operators are defined only in the “causal” 
region 


>t >t 


we may take them to be zero fort > t, < ty. This enables us to extend the 
range of integration in (8.120) from —© to ~. 


476 Fundamental Principles of Quantum Mechanics 


11. Case of a Time-Independent 
Perturbation 


If V is independent of time, a formal solution of (8.120) may be ob- 
tained by taking Fourier transforms . The Fourier transform of Uy may 
be written 


usw) = {Udit ei dt (8.121) 
the inverse relationship being 
—_ a ; —iwt 
U,(t) = | uw) é dw (8.122) 
To ensure convergence at the upper limit, we replace (8.121) by 
Uy(@) = lim ft 0(t) e 7 tHotih etotint 
n-0 —© 


where @(t) = 1 for positive argument and zero for negative. Thus 


ones | 
Uo(w) = ih lim kao H+ thy (8.123) 


Since it is common practice nowadays to choose units in which 
h=c=1 (8.124) 


(c being the velocity of light), we henceforth drop the f’s, thus effecting 
considerable economy in writing. 

The factor in in (8.124) assures the “causal” property of Uo(t) 
(the same device was used in Chapter 4 in a more general context). This 
is due to the fact that, in evaluating (8.122), 

Ut) = a lim - nae et day (8.125) 
for tf > 0 the contour must be deformed around the lower half-plane, 
which encompasses the singularity; whereas, when ¢t < 0 it must be 
deformed about the upper half-plane, so that there are no singularities 
within the contour and the integral is zero. 

A word should be said about the interpretation of these remarks, and 
in fact about the meaning of a quantity like 


1 


H), after all, is an operator, so that this quantity is also an operator. Its 
meaning is clear if we use a representation in which the basis vectors are 


11. Case of a Time-independent Perturbation 477 


the eigenstates of Hy. If these states are designated by |n), and the 
corresponding energies by E,, the sequence of calculations proceeding 
from (8.121) may be interpreted as follows: If uo(w) operates on |n), and 
the exponential is interpreted in the sense of a power series, then 


etHotit |) = ee tin |n) 
At each subsequent step in the calculations, when an operator involving 
H, operates on a basis state |), it produces the value E,. Finally, if we 
know the effect of an operator on the basis vectors, we know its effect on 
any vector, since the basis vectors form a complete set. 
Without loss of generality, we can take tf) = 0. If we then take Fourier 


transforms of (8.120), and designate the transform of U(t,0) as u(w), we 
get 


u(w) = Uy(w) — iUo(w)Vu(a@) (8.126) 


where we have used the fact that the last term of (8.120) is a convolution. 
Putting in the form of up, 


| i 
Multiplying on the left by # — H)+ in, we obtain 
(o—H+in)u=i 
or 


u=i(o—H+in (8.127) 


This result, though compact, is completely useless for purposes of 
calculation, except in so far as it might serve as the basis for an approxi- 
mation procedure. 

Some interesting results can be obtained by making a “‘perturbation”’ 
expansion of (8.127). We write 


H=H,t+VvV 
and we are then confronted with the problem of expanding the operator 
eal 
A+B 
in a “‘power series”’ in B, where in our particular case 
A=o—H,t+in (8.128) 


and 
B=-V (8.129) 


478 Fundamental Principles of Quantum Mechanics 


It is easy to show that 


lol Lp tpl pt... 
A+B A 42417248 + (8.130) 
To verify this formula we multiply the operator on the right-hand side by 
A, then by B, and add. All terms cancel except the initial term (unity). 
Specifically, then, we have 


eee ee eee ee / ee eee 
w— Hytiy @—Hyotin w—Hot+in 


i i 


ae ee ae, 7 ee ey; eee 
oH) bf oA Em oem oy 
Let us write this as 
eee eee a ter ee, eee, ee eee 
as oe 7 ear oH o= Aan (8.132) 
U(w) = Up(W) — iUy(w)T Uo(w) (8.133) 
Then T is the operator 
= ee ees a ee, ee ey Te 
LV aa!) 5 a oan (8.134) 
= V —WuoT 
= V—iVu(w)V 
1 . 
T=V+ ern: arora (8.135) 
It should be noted that 
Tuy = Vu (8.136) 


The form (8.135) for the T operator is compact, but again of no practical 
use since eigenstates of H are not known, so that we cannot find a matrix 
representation of i/(#—H+in) = u(w). The form (8.134) serves as the 
basis for a perturbation expansion. 

We note that, from (8.103), 


| (t)) = U(t,to)|(V(to)) = U(t,0)|¥ (0) ) 
Now 


1 
-o w—-H+in 


= aH | u(w) e~t dw (8.137) 


ei! dw 


11. Case of a Time-Independent Perturbation 479 


In what follows, we do not write the in explicitly, but treat the frequency 
w as having a small positive imaginary part. We see that the evolution of 
a system is determined once we know u(w). If we consider the evolution 
of the state |‘Y(0)) = |0) as determined by (8.131), we observe that the 
transition from |0) to some final state may take place through inter- 
mediate steps which involve transitions back into the onginal state. 
Aside from producing analytic complications, all such processes which 
start and end in the state |0) describe not so much physical transitions as 
modifications of the character of the initial state itself. In other words, if 
after a time ¢, a system originally in state |0) is found again in that state, 
we may treat all that has taken place in the intervening time as character- 
istic only of that state itself. 

Let us now see, then, how to separate out all such processes, and 
incorporate them in a modified (‘‘renormalized’’) initial state. 

For this purpose we introduce P as a projection operator onto the 
original state, and Q = 1—P as an operator of projection out of that 
state. Consider now Eq. (8.126), and put 


u=Pu+Qu (8.138) 
We note that by orthogonality 


Quy|0) = 0 (8.139) 
An explicit representation of P and Q is given by 
P=|0){0/|, Q= Ir) (r| (8.140) 
If, in the equation 
Put Qu = up—itV(Put+ Qu) (8.141) 


we separate the components in and perpendicular to |0), we obtain the 
equations 


Pu = Uy — itoPV(Pu) — ity PV(Qu) (8.142) 
and 
Qu = — iuQV(Pu) —iU,QV(Qu) (8.143) 
From the first of these equations 
Pu = (1+ iupPV)“ u [1 —iPV(Qu)] (8.144) 
Thus, the operator 


Uy = (1 +iupPV) uo 


= eee “1 
=(1 soPY) Uo 


480 Fundamental Principles of Quantum Mechanics 


operates only on |0). But 
(1- PV) u|0) = | 1+— _ py 
w— Ho w—Ho 


] l 
w— Hy PV ON 
= (KO|VIO)\" i 
o-E, 
0 0 


w— Epo 


+ PV+-- | u9|0) 


ge Se 
7 1 — Voo/(@ — Eo) o — Ey 10) 


a ee 
Sn H,uv, (8.145) 
where 


is the mean potential in the initial state. It follows that 


i 


1 @— Ho—Voo (8.146) 
and that 
Pu= u,({1—iPV(Qu)] (8.147) 
or 
Pu=u,(1 —iPVi) (8.148) 
where 
i= Qu (8.149) 
In the same notation, Eq. (8.143) may be written 
(Up-1+iQV)a = —iQV(Pu) (8.150) 


We see, that, in principle, (8.148) and (8.150) may be solved for Pu and 
ui. Equation (8.150) enables us to express # in terms of Pu; putting this 
back in (8.148) then permits a solution for Pu. From Pu we may obtain 
{0|u|0), which in turn permits us to determine the probability amplitude 
for a system, starting in state |0) at time ¢= 0, to be still found in it at 
time t. That is, we may calculate the decay from this state (or the scatter- 
ing from it, and so forth, depending on the particular problem considered). 
ii, in turn, permits us to determine the amplitudes for transition to various 
possible “final” states. 


11. Case of a Time-Independent Perturbation 481 


To carry out the calculations explicitly, we first take the diagonal 
matrix element in the initial state of (8.148): 


(O|u\0) = SEV; [ -i Y (OIV|s) (s|a0) | (8.151) 


Latin indices such as “‘s” are taken to comprise all states except the 
initial state |0). 

The matrix elements (s|u|0) may be obtained by writing (8.150) in 
the form 

Gi = —i(uy 1+ iQV) OV (Pu) 
and expanding 
(uy? + iOQV)7} = Ug — UpsiQVug+ UjiQVugiQVuyt .ee 

The matrix element in question is then 


(s|u|0) = (s|z|0) 


= ana > wane 


+S (SIVID a, A . | 

x (O|u|0) (8.152) 
Therefore, 

2 (O|V|s) (s|u|0) = F(@) (0|u|0) (8.153) 
where 


Fo= > {CIV IS) { (51710) + ¥ (SIMD Sz, M10) 


+S sIVID Sage (Vm) Se (miv|0)-+-- 1.154) 


In all the sums the summations are over all states but |0), which is 
excluded. Substituting (8.154) in (8.151), we obtain 


(0|u|0) 1 -—fel =! 


— Ey—Voo @ — Eyo— Voo 


482 Fundamental Principles of Quantum Mechanics 


from which it follows that 


(O|u|0) = (8.155) 


i 
@w— Ey Voo — F(@) 
It is easily seen that 


Voo+ Flo) = (0|V+V 9 vy iyQ_y_Q v-+---|o) 


w— Ho w— Hy w— Ay 
= (0|K|0) 
= K(w) (8.156) 
where 
= Q Q Q re 7 
ReVIV a Van. aH (8.157) 


K clearly satisfies the equation 


K=v+v—2--k (8.158) 
w— Hy 


It is possible to write {s|u|0) in terms of the same operator K: 


1 
—" H,K|0) (O|u0) 


(s|u|O) = (s 
1 


ay Se Oe ee ee ee 
=(s w— Hy 0) TKO 


The quantity (0|K(@)|0) is a sort of “self-energy” of the system at 
frequency w. 


K (8.159) 


12. Some Useful Relations 


It is easy to make a connection between the matrix elements of T and 
of u, using (8.133). For if, in this equation, we project into and out of the 
state |0), putting 


T = PT+QOT=T,+T, (8.160) 
we see immediately that 
O'7 ,|0 
(O|u|0) = (Olu,|0) + {oF (8.161) 
@— Eo 
Comparison with (8.155) and (8.156) reveals that 
O1K,|O 

(o\7,|0) = _{01K,|0) _ (8.162) 

, _ 401K:op 


w@ — Ep 


13. Calculation of Total Transition Probability 483 


Also, equating Q components of (8.133) gives 


1 
(slul0) = ner (s|T2|0) (O|u,|0) 
1 
= DWE, $5IM0) (Oluol0) (8.163) 
Comparing this with (8.159) we find that 
(Olugl0) (s|7|0) = <s|K|0) <Olu|0) (8.164) 


From these equations it is straightforward to find (0|U(t,0)|0) and 
(s|U(t,0)|0), and thus the probability that the initial state |0) remains 
after time ¢, and the probability that the system has made a transition 
to state |s). First, 


O|u(w) 0) =———_1___ | 
a av ry 2 PE sor 


whence 


(0}U(t, 0)}0) = <— lim 


oS 0 od. 8.166 
27 7-0 | Merersrey er re ie 


Then 


(s\u(c)|0) =—— 


SSS +i ee ae 
otinaeE, “Kt 0) ine RO a (8.167) 


from which it follows that 


ee ie l l 
(s|U(t, 0)|0) = 5, lim [- ae aye 


n—E,)(@+in—E,—K) 


xX (s|K(@-+ in) |0) e-i (8.168) 


13. Calculation of Total 
Transition Probability 


It is very difficult to evaluate (8.166) and (8.168) exactly, because of 
the complicated dependence of K(w) on w. In the standard form of per- 
turbation theory given in most text books, K is omitted completely in 
the denominator in (8.168). However, in (8.166), such a neglect does not 
lead to any decay of the initial system at all, and so is not acceptable. 
In this case, we may have recourse to an approximation. A simple such 
approximation, which permits us to complete the calculation, is that in 


484 Fundamental Principles of Quantum Mechanics 


which we evaluate K(w) only to the second order in the perturbation V: 
ton) 
wtin—Hy a 
O|V|s)|? 
ee 
= (0110) + Sorin By 
where it is understood that we ist at the end of the calculation, let 
7 — 0. But 1/(@+in—E,) approaches 1/(w— E,) except in a very narrow 
region near w = E,. However, since 
l _ (w—E,)—in 
w—E,+in (w—E,)?+7? 


(0|K|0) = (0|V10) +(0 v 


(8.169) 


for |w — E,| < € < nit is approximately equal to its imaginary part 
—in 
(@— E,)° + 7’ 


This imaginary part — 0 as 7 — 0 except for w ~ E,, becomes infinite at 
w = E,, ard gives, on integration over E,, 


—j |tan-* = = — Ti 


Thus it is possible to write 


] : 
SE POE, TEs w) (8.170) 


This signifies that, when we integrate 1/(w—E,+in) with a function 
A(E,) of E,, we get areal part which is the principal-part integral 


P {= ACD dE, 


and imaginary part which is 


| A(E,)8(E,— @) dE, = Alo) 


If, now, the states |s) in (8.169) form a continuum, the sum may be 
replaced by an integral giving 


1<O|V|s) |? { <OIV Is) |? 
> ey ae (E:) wt+in—E, 


where p(E;) is the number of states per unit energy interval at energy E, 
(density of states at E,). Then, using (8.170) we obtain the result 


dE, (8.171) 


[o (E,) KOWIS)F op p [o (g,) KOE ae, — rip(w)|(O|V Is) e2 
w+in— Ez (8.172) 


14. Transition to a Definite Final State 485 


|{O|V|s)|,.,2 represents an average square matrix element over states for 
which E, = o. 

In order to evaluate (8.166), we now make another approximation, 
namely, that K is varying slowly enough that K(w) may be replaced by 
K(E>). If we put 


K(w) = SE,(w) — til (@) (8.173) 


this is equivalent to saying that SE, and J’) are slowly varying functions 
of w at E,, so that to a good approximation 


K(w) > 8E,(Eo) — il (Eo) = SEo —3 ilo (8.174) 
dE, and |’, being constants given by 
2 
=» {pie HOME ae, 8.175) 
and 
Vo = 27p(Eo) | (0|V|s) lc,=2, (8.176) 


Returning now:to (8.166), we can evaluate the integral 


(0U¢,0)10) =5¢ [° ——- 


ee 
ee —$E,+i9,/2 © (8.177) 


by deforming the integral around the lower half-plane to obtain 
(0| U(t, 0)|0) = e~tEot SEont e Fotl2 (8.178) 
It follows that the probability that the system is still in the state |0) at 
time ¢ is 
Poo(t) = |(0|U(z, 0)|0) |? 
= e Tot (8.179) 


so that I’, is the decay constant of the system. 

It should be emphasized that the formula (8.176) gives the decay 
constant only approximately. This formula is generally called the “golden 
rule” formula, following the terminology first introduced by Fermi. 


14. Transition to a Definite 
Final State 


Using the same approximation as in the previous section, we may 
evaluate (s|U(t,0)|0) from (8.168) and so evaluate the probability of 
transition from state |0) to any arbitrary state in time ¢. For in this 


486 Fundamental Principles of Quantum Mechanics 


approximation 
(s|U(t, 0)|0) = = tim [ eee! eee eeeee ee: ener 
2m no J_. otin—E, wtin—Elt+ iT /2 
X (s|K(E>)|0) e- dw (8.180) 
where 
Ey = Eo t+ SE (8.181) 
Figure 8.1 


The integral may be evaluated by deforming the contour around the 
lower half-plane (Fig. 8.1), to give 


(s|U(@, 0)|0) = (s|K(E0)10) GF a) —E, 


x [en tot eTotl2 — e Est] (8.182) 
Thus, the probability that the system is in state |s) at time ¢ is 
os 1 
Py (t) ~~ (s|K(E,)|0) (E,—E,)? +1 2/4 
X [1 + eT —2e Tet? cos (Ey — E,)E] (8.183) 


From this we may find an expression for the total transition proba- 
bility, provided we may neglect the variation of the matrix element 
(s|K(E>)|0) and the density of states p(E,) with energy Over a range 
=I, about E,= Ep. For if we replace |(s|K(E,)|0)|? by its average 
|(s|K(E,)|0)|3,, over the region in question, the probability that some 


14. Transition to a Definite Final State 487 


transition has taken place by time ¢ is 
D Pott) = f p(E,)Pos(t) Es 
= p(Eq) { Pos(t) dEs 


= p(Ep) |(s|K(Eo)|9) |? 


= 1 
. | _ Ee En +34 


x [1+ eT —2e Te"? cos (E, — E,)t] dE, (8.184) 


But 
a 1 2a 
EE Tels eT, (8.185) 
and 
cos (E, — Eo)t 27 oTotle 
(E.— Es? +14 oo (8.186) 
Therefore, 
S Patt) = FE) plE) MsIKEd 0), (8.187) 
Finally, we see that 
GPoolt)_ sed 
dt _— > Postt) 
= — elt 2i7p(Epo)| (s|K(Eo)|9) |%c (8.188) 


This implies a I’ which is an improvement over that given by (8.176) 
in that (0|V|s) is replaced by 


(0|K(E,)|s) = (0|V|s) + > a 


cre (l|V|m) (m|V|s) 
>> | es a (8.189) 


This is not unreasonable, since the approximation (8.169) to I) was 
based on considering only direct transitions from the state |0), whereas 
(8.189) allows for indirect ones. Better approximations to (0|K(@)|0) 
than that of (8.169) are of course possible. What is important to note is 
that the calculation of P,,(t) which we have given does not depend on the 


488 Fundamental Principles of Quantum Mechanics 


approximate form of I) which we calculated. Thus, while (8.187) is not 
self-consistent as it stands, it can be made so by calculating I) more 
accurately. However, we do not pursue the matter further here. 


Problem 8-12: A quantum-mechanical system in the 
absence of perturbations can exist in one of two states |1) and |2). In 
the presence of a perturbation which has no diagonal matrix elements for 
these states, but only the element (1|V|2) = (2|V/1)*, find the proba- 
bility that the system, which at time ¢t = 0 is known to be in state |1), will 
be in |2) at time ¢. 


Problem 8-13: In the preceding problem, suppose 
that state |2) is at energy AE higher than that of state |1). Taking a 
perturbation 


V( eit + eit) 


find the probability of transition (i) from |1) to |2) and (ii) from |2) to 
|1), in time ¢. 


Problem 8-14: Solve the above problems if there is a 
third state (3) higher in energy than both |1) and |2), and such that the 
only nonzero matrix elements of the perturbation are those between |1) 
and |3), |2) and |3). 


Problem 8-15: Solve the same problems where the 
final state in each case is part of a continuum. Neglect transitions between 
states of the continuum. 


Problem 8-16: Consider an electron in a uniform 
magnetic field in the z direction. Suppose that its spin is in the x direction 
at t= 0. Find the probability that it is in each of the following states at 
time ft: 


@) S; = 
ii) S,=-4 
(iii) S: ai 
(iv) S,=4 


15. Case of a Time-Dependent 
Perturbation 


Let the perturbing Hamiltonian be of the form 
H,=Vv«a,h (8.190) 


15. Case of a Time-Dependent Perturbation 489 
Taking the initial time t) = 0, Eq. (8.120) becomes 
t 
U(t) = Ut)—i [, dt,Uolt—t)V(r, t)U(h) (8.191) 


where, as earlier, we have put 4 = 1. This equation is difficult to solve, 
even approximately, in general. A particular case of interest which can 
be treated is that of a periodic potential: 


V(r,t) = F(r) e@%+ Ft(r) e7 (8.192) 


where F is a stationary operator and Ff its Hermitian conjugate. 
To solve (8.191), we first take the Fourier transform of the equation. 
The transform of the integral is 


es dt e fiodtUo(t— 1) [Feo + Ft eon} U(t,) (8.193) 


The limits of the inner integral have been extended from — © to ~, since 
the evolution operators Uy, U may be taken to be zero for negative 
argument (this is the expression of the condition of causality). 

The variable t may be replaced by a new variable 


t'=t-t, (8.194) 
so that (8.193) takes the form 


f - dt’ et'U,(t') f be eiwtt | F eivots +. Ft eto] U(t,) dt, 
= uy(w) [Fu(wtao) + F tu(@— ao) | (8.195) 


Therefore 
u(w) = Uo(w){1—i[Fu(wt+ oo) + F tu(w— wo) }} (8.196) 


This may be solved by an iterative (perturbation) procedure. Two 
features of this procedure should be noted: 

(i) In successive approximations, we get terms involving u with 
arguments w+tnwo, n being an integer. This implies the possibility of 
successive emission or absorption of amounts fw, of energy. To the 
lowest order (obtained by replacing u by u, on the right), only transi- 
tions between the initial energy state E, and states with energies Ey + hay 
will be accounted for, as we see below. 

(ii) In higher approximations of even order, transitions will be 
possible back to the initial state (absorption followed by emission and 
vice versa). Such terms again have the character of a self-energy. 


490 Fundamental Principles of Quantum Mechanics 


16. First-Order Perturbation 
In the lowest order of perturbation, in which uo replaces u on the 
right-hand side of (8.196), wis given by 
u(w) = Uo(w) = iuo{ Fug (@ =e Wo) ae Ftuy(@ a wo) | (8. 197) 
In this approximation, the time evolution operator is 
U(t) = = Up(w) {1 —i [Fug(w + wo) + F tug(w— aw) ]} eo dw (8.198) 
The amplitude for transition from state |0) to state |s) in time ¢ is then 


ee ee ik 1 Syren sekeeres 
(s|U(t)|0) a= lim [_ ey ea | (siF IO) @ + @y—Eo tin 


| ei! dw (8.199) 


=F SO =e i 


The integrals may be evaluated by completing the contour with a semi- 
circle of radius — © around the negative half-plane. In this way we obtain 


1 
E,— Ey +o 
| 
—E,—®@ 


The probability that the system is in state |s) at time fis 


{s|U(t)|0) = (s|F|O) [e-ist — ei Bo-oot] 


+ s| Ft |0) [e~ test — e—tEotwo)t) E (8.200) 


Post) = |(s|U(10)|? = | s|F|0) 2 SBEEa = Bowe 


sin? $(E, — Eo a Wo)t 
[2(E,— Eo + @o)]? 


where the remaining terms R, which are cross terms in the magnitude 
squared of the right-hand side of (8.200), may be written 


Ra SIFIOP 64 cos 28 [sin2d(E — E y—@»)t+ sin? $(E, — Eg + @)t] 
~ (E,— Eo)? — ae s 0” ®o sin’ 3(E, oT @o 


— 2 [cos 26 —cos (2wot + 28)] 
—2 sin 26 [sin (E, — Eo — wo)t — sin (E, — Eg + a) t] (8.202) 


+ |(s|Ft|0)|? +R (8.201) 


5 being the phase of the matrix element (s|F|0). Thus the probability 
per unit time of transition from state |0) to state |s) at time t is 


OPoslt) _ 2 ee walt sin(E,—E tap )t| , dR 
ot 2|(s|F10)| E,— Eo —@ * E,— Eo +o ss ot 
(8.203) 


17. Perturbation Theory for Stationary States 491 


where 


OR _ sin [((E, — Ey — wo)t — 26] | sin [(E,— Ey + wo) t +28] 
B= 2|(s|F 10) P| E,— Ey+ wo 7" E,—E )— @o 


Wo E 
= Eko sin (2@of + 23) (8.204) 
Considering first (8.203), we observe that, for large t, the function 
(sin tx)/x approaches the 6 function 775(x). Therefore, ignoring 0R/dt, the 
probability of transition per unit time for large t approaches 


OP os 


7 = 2a|(s|F|0) |? {8(E, — Eo —@o) +8(Es— Eo t+ @o)} (8.205) 


If we make a better approximation in which the level width is taken into 
account, the 6 function is “broadened.” 

As for the dR/dt term, it is oscillating and is never large; it decreases 
rapidly as the energy of the final state becomes substantially different 
from E, = Ey + hao. 

The relative importance of the two terms is illustrated by con- 
sidering the case in which there is a continuum of final states |s). If we 
sum (integrate) over these final states, we obtain from (8.205) 


OP os 


af 2A ICSIF IO) l’av Lo(Eo + 0) + p(Eo — @o)] (8.206) 


where p(E) is the density of states. As for the first two terms in (8.204), 
for large ¢ the sine functions will oscillate rapidly over a range of energy 
for which the matrix element and density of states generally varies very 
little; their contribution is therefore very small. The last term oscillates 
in time and thus does not give rise to “‘permanent”’ transitions. For 
sufficiently large wo, and for periods of observation > z/w,. it has no 
measurable effect. In fact, over periods ~ 7z/w 9, the energy of the final 
state can only be determined to within an uncertainty ~ 2f@p, and it is 
for this reason that the term appears. 


17. Perturbation Theory for 
Stationary States 


In the majority of quantum physical problems, it is not possible to 
find the eigenfunctions or eigenvalues exactly. The steady-state problem 
may, however, often be put in the form 


(H+ dV) |b) = Ely) (8.207) 


where the eigenstates of H, may be determined exactly; AV is designated 


492 Fundamental Principles of Quantum Mechanics 


the “perturbation.” Solutions for the states |y) may then be expressed in 
power series in A. The solution will, of course, only be valid if the exact 
solution is an analytic function of \; where perturbation theory is applied, 
it will be assumed that this is the case. Also, the successive terms in the 
expansion become more and more complicated, so that the practical use 
of perturbation theory is confined to cases in which all but the first few 
terms of the series are negligible. 

There are two distinct versions of perturbation theory. The more 
widely used is associated with the names of Rayleigh and Schrodinger. In 
this case both |W) and E are expressed in series in d. In the other ap- 
proach due to Brillouin and Wigner, |ys) is expressed as a series but the 
eigenvalues E are determined implicitly. 


18. Rayleigh—-Schrodinger 
Perturbation Theory 


When the perturbation has zero strength, i.e., when A = 0, the states 
|2) are determined by 


H,|n) = E,|n) (8.208) 


As 2 is gradually increased, each “unperturbed” state evolves into a 
“perturbed”? one; there is a one-to-one correspondence between the 
unperturbed and perturbed states. For convenience, we designate the 
unperturbed state to be considered as |), and its energy E>. Thus, the 
wave equation for the original state is 


Holo) = Eglo) (8.209) 
and that for the corresponding exact state is 
(Ho + AV) |b) = Ely) (8.210) 
We designate the energy shift as 
AE=E-E, (8.211) 
Then subtracting (8.209) from (8.210), we obtain the equation 
(Eo — Ho)( |b) — |Y0)) = (AV — AE) |b) (8.212) 


We take |i) to be normalized, but choose |) so that its component on 


\%o) has unit amplitude; consequently, |) is not normalized to unity. In 
this case 


Plys) = |ypo) (8.213) 


where P is the projection operator on the initial state, and (8.212) may be 
written 


18. Rayleigh-Schrddinger Perturbation Theory 493 


(Eo — Ho)Q|) = (AV — AE) |p) (8.214) 


Q being the operator (1 —P), which consequently projects out all com- 
ponents in the direction of |). 

Since the left-hand side has no component in the direction of |), 
neither has the right; we may therefore place the operator Q in front of 
it. Then, multiplying across by the operator (Ey — H,)~! = 1/(E) — Ho) we 
find that 


Ol) = p— 7 OAV -AB)|y) 


or 


1) = lo) +p — pF OV —AB)|y) (8.215) 


The second term on the right represents the perturbation; the equation 
may be solved by iteration. 

AE may be obtained in any approximation once |) is known, for 
on multiplying both sides of (8.214) by the bra vector (o|, the left-hand 
side vanishes and the equation 

(Pol) 


is obtained. With the normalization referred to above. the denominator is 
unity, so 


(8.216) 


AE =) (iholV iy) (8.217) 


Thus, if |) is determined up to terms of order \”~!, AE is obtained to 
order A”. - 

The successive approximations to |W) may be expressed explicitly 
by writing (8.215) in the form 


1 
| gg, QV —AE)| |v) = Io) 
or 
= 1 - 
dials QV —AB)| Ibo) (8.218) 
Therefore, 
Ib) = |v) +e _ se. A, V\Yo) 
Q 
a EH (AV - a 7, *Vi¥o) +-++ (8.219) 


We see from (8.217) that AE itself is a power series in A, i.e., 
AE = = AEM 


494 Fundamental Principles of Quantum Mechanics 


Equation (8.218) can be expanded as a power series in A in terms of the 
eigenstates |n) of Hy, after which A may be put equal to unity. To the second 
order in the wave function and the third in the energy 


\W) = |%) + x 2 Be, Hallo) 


+> In) we wi (YolVI Yo) pes (8.220) 


From (8.217) the energy may be expressed in the series: 


: V 2 
E = Ey + (lv) + Sel 


4 > (WolViVX al Vibe YalVi Wo) 
n,n’ (Eo —E (Eo —E n’) 


bi oe 


— (WlV|%o) (8.221) 


aon ee 
At each step in the expansion the hes term in the energy is calculated 
from the (n — 1) term in the wave function. 


19. Degenerate Case 


The above development runs into difficulty if the state |W) is de- 
generate; that is, if any of the states |n) have the energy Ep. The treatment 
of this case requires a modification of the above derivations. Let us 
suppose that there are a number of states |8) with the same energy Ep, 
and that we wish to calculate their perturbation. If P is now the projection 
on the subspace of the states |B), following exactly the lines of the 


previous calculation, we obtain an equation formally identical with 
(8.214), 


(Ey — Ho)(1 — P)|b) = (1 —P)(V — AE) |b) (8.222) 


Before proceeding further, we note that if the states |B) are so chosen 
that there are no matrix elements of V between them, there will not be any 
zero denominators. Since any linear combination of the |8)’s is an eigen- 


state of Hy with eigenvalue Eo, we first solve within the subspace of the 
| B)’s the eigenvalue problem 


V|B) = e,|B) (8.223) 


or 


> |B’) (B'|V|B) = €6|B) (8.224) 


19. Degenerate Case 495 


The €,’s are determined by the secular equation 


Det [(B’|V|B) — €5g,"] (8.225) 


Since V is a Hermitian operator, the eigenvalues are real and the eigen- 
states orthogonal within the degenerate subspace. 


Reinterpreting P = Pz, in (8.222) as the operator of the projection on 
one of the states just determined, we have 


ls) = |b) aes Al — P,(V — AE) |) 
By iteration 


ID) = lo) + py (1 — Pa )(V — AE) |e) 


1 
+5 aH," —Pe)(V — AE) Fr (I —P,)(V—AE)|We) + °°: 


As before, the last AE terms go out because (1 — P,)|%g) = 0. Also, 
because Vide) has no components on the other states of the degenerate set, 
(1 — Pg) in the above may be replaced by the operator (1 — P) which pro- 


jects out of the whole degenerate subspace. Thus, we obtain a formula 
similar to (8.220): 


lv) =|8)+ >” |n) ovis) 
+2 In) (Ey — E,,)(Eo — Ey) + (8.226) 


where the prime on the summation sign > signifies that the intermediate 
states |n), |n’), and so forth have no components in the subspace of the 


|B)’s. 
Taking the scalar product of (8.214) with | 8), we find that 


AE = Sune (8.227) 


Using (8.226), we obtain an expression for AE; E = Ey + AE becomes 
= » {B\V|n) (n| VIB) 
Es mene E,—E, 
>’ AM 'v(n'Winvnlvip) 
n,n’ (Ey ras E,, (Eo — E,) 


_ > MAMA) — 
n (Eo n) 


496 Fundamental Principles of Quantum Mechanics 


20. Brillouin—Wigner 
Perturbation Theory 


We proceed in a manner very similar to that of the Rayleigh- 
Schrédinger theory, except that, when we subtract (8.209) from (8.210), 
we write the result in the form 


(E — Ho)(|¥) — |Yo)) = Vis) — AE| ho) (8.229) 


Again introducing the operator P which projects on the initial state |p) 
and QO = 1 —P, and assuming that |W) is normalized so that 


(holt) = 1 (8.230) 
(8.229) becomes 
(E — H,)Q|b) = V\b) — AE |b) (8.231) 
Taking the product of each side with the bra vector (io|, we find that 
= (WolV |b) (8.232) 


Since, from (8.231), P operating on the,right-hand side gives zero 


(E — Ho)Q|s) = QV |p) 


or 
(1 —P)|¥) = Eon. QV |p) 
or, finally, 
lve) = |o) +5 QV|#) (8.233) 


This leads, exactly as before, to the perturbation series 


Substituting this in (8.232) gives 


AE = (WolV bo) + (WolV —p OV do) 


1 
+ (pfolV E-H, QV 


I 
He QV |b) +- °° (8.235) 


It is interesting to compare the results of second-order perturbation 
theory in the Rayleigh-Schrédinger and Brillouin-Wigner schemes. 


20. Brillouin-Wigner Perturbation Theory 497 
Writing 


Von = (WolV|n) (8.236) 
we obtain from (8.221) the Rayleigh—Schrodinger result 


2 
E= Eot+ Voot+ S pat (8.237) 


while from (8.235) the Brillouin-Wigner method gives 


2 
E = E,+ Voo+ > eo (8.238) 


An interesting feature of the Rayleigh-Schrodinger method is the 
following: The perturbation energy of the ground state to second order is 
negative, even though the perturbation admixes components of higher 
energy states. This is due to the fact that the modification of the wave 
function by the perturbation is such that the mean value of V in the 
perturbed state is Jess than in the unperturbed one. 


Problem 8-17: (i) Calculate to second order in V, 
the mean value of H, in the perturbed state, and the amount by which it is 
greater than the unperturbed ground state energy Ep. 


(ii) Calculate, also to second order 
in V, the change in the mean value of V due to the perturbation. Show that 
it is negative and twice as big as the energy calculated in (i). [Caution: 
Remember that normalization of the perturbed wave function must be 
taken into account.] 


Problem 8-18: Two plane rotators, each with Hamil- 
tonians of the form 


hz 9? 
~ 2ma? ae 


are coupled by the potential 
— B cos (6; — 62) 


6, and @, being the angles of the two rotators. Use perturbation theory to 
estimate the energies of the coupled system. 


PRELUDE TO CHAPTER 9 


This chapter is devoted to the solution of some of the 
standard problems of quantum mechanics. The first is 
the one-dimensional harmonic oscillator, which is 
analyzed in terms of “‘ladder operators,”’ one of which 
increases the quantum number by 1 (creation operator) 
and one of which decreases it by 1 (annihilation 
operator). This problem provides the foundation both 
for the quantum theory of fields and for the ““second 
quantization” formalism which is the most logical 
basis for many-body problems. 

The problem is solved both by operator algebra 
and by use of the Schrédinger wave equation. The 
wave functions are expressible in terms of Hermite 
polynomials, introduced in Chapters 2 and 4. 

The second topic of the chapter is the theory of 
angular momentum, which plays a role in all areas of 
modern physics. The theory is developed on the basis 
of operator algebra, using only the commutation rules 
for the operators. Again, a_ creation-annihilation 
formalism can be developed. Integer and half-odd- 
integer (/) states emerge from the same approach. 

Orbital angular momentum, which arises from the 
separation of the Schrédinger equation in problems of 
spherically symmetric potentials, is treated in spherical 
polar coordinates. The eigenfunctions are the spherical 
harmonics introduced and discussed in Chapter 3. 

The states j= s=4 represent spin; the usual 
Pauli spin operators are derived. It is then shown, 
however, that the matrix operators of higher order 
(27+ 1) may be found for all j, including integral ones. 

An alternative derivation of angular momentum 
theory due to Schwinger is then developed. Schwinger’s 
approach makes use of pairs of creation and annithila- 
tion operators, combined into so-called “spinor 
operators.”’ The theory is therefore linked to that of 
the harmonic oscillator. It is shown that angular 
momentum states may be expressed in a very simple 
way in terms of the two creation operators. 


499 


500 


Prelude to Chapter 9 


We turn next to a very important theory —that of 
the addition of angular momenta. The addition may be 
that of several particles of a composite system, or of 
intrinsic and orbital angular momentum of a single 
particle. The composite angular momentum states 
are related to those of the components through the 
Clebsch-Gordan coefficients. Our purpose is not to 
give a general formula for such coefficients, which is 
very complicated and in any case can be looked up in 
handbooks, but rather to demonstrate the principles 
underlying the problem, and to give a straightforward 
method of calculation which can be used in those 
common problems in which jis not too large. 

We then turn to the problems of harmonic 
oscillators in two and three dimensions. Various 
approaches are considered. It is possible, using 
Cartesian coordinates, to decouple these problems 
into that of two or three one-dimensional ones. The 
states are then seen to be highly degenerate, and one 
must seek suitable quantum numbers to define the 
degenerate states. 

In two dimensions the problem is also solved in 
terms of polar coordinates; the wave function may be 
expressed as a confluent hypergeometric function. 

Finally, an operator method is introduced. Oper- 
ators are found which independently raise and lower 
the energy and the angular momentum; all states are 
then directly derivable from the ground state (quantum 
numbers zero) by successive use of creation oper- 
ators. These operators are given a Schrodinger 
representation, so that the wave functions may be 
calculated directly by repeated operation on the 
ground state. 

The most important application of the two- 
dimensional oscillator theory is that of the quantum 
states of a charged particle in a constant magnetic 
field. 

The three-dimensional oscillator is first treated by 
a generalization of the method just discussed; and a 
new third creation and annihilation operator must be 
introduced. Simultaneous eigenstates characterized 
by three quantum numbers (number of quanta, two 
angular momentum quantum numbers) are derived by 
creation out of the ground state. 


Prelude to Chapter 9 501 


The solution is also obtained from the Schrodinger 
equation, by separation of variables. The angular 
functions are angular momentum eigenstates; the 
radial function is a confluent hypergeometric. 

The hydrogen atom problem is then studied, using 
the factorization method discussed first in Chapter 3. 
This method bypasses the usual complications of 
solution in series. The eigenvalues of energy come out 
in a very straightforward way, and excited states are 
again obtained by using appropriate creation operators. 

Finally, we consider the problem of scattering of 
a plane wave by a central potential of short range. The 
problem is treated in a manner quite parallel to that 
used for scattering of sound waves in Chapter 5, by 
using a “partial wave expansion’? (expansion in 
spherical harmonics, i.e., angular momentum eigen- 
states). It is possible not only to treat pure (“‘potential’’) 
scattering, but also absorption inside a ‘‘scattering 
center.”” The quantum-mechanical problem is not 
solved inside this center. The solution outside is 
shown to be completely determined by the logarithmic 
derivatives of the radial functions of the partial waves 
at its surface (that is, by the prescription of a sort of 
“surface impedance’’). 

Once “scattering”, “absorption”, and “‘total 
(= scattering + absorption)”’ cross sections are deter- 
mined, it follows that the inequalities developed in 
Chapter 5 hold once again. 

Finally, we consider very low-energy (angular 
momentum zero) scattering. We see that absorption 
may be described by giving the potential inside the 
scattering center an imaginary part. In this case we 
discuss “resonance scattering,” deriving the well- 
known Breit-Wigner formula of nuclear reaction 
theory. The conditions which define the location and 
width of the resonance are determined, and the inter- 
ference between “resonance” and “potential” 
scattering demonstrated. 


REFERENCES 


Gottfried, K., Quantum Mechanics. New Y ork: Benjamin, 1965, Vol. 1. 
Infeld, L. and T. E. Hull, Rev. Mod. Phys. 23, No. 1 (1951). 


502 Prelude to Chapter 9 


Landau, L. D. and E. M. Lifshitz, Quantum Mechanics. Reading, Mass.: 
Addison-Wesley, 1958. 


Loeb, A. L. and L. Harris, Introduction to Wave Mechanics. New York: 
McGraw-Hill, 1963. 


Mattis, D., The Theory of Magnetism. New York: Harper and Row, 1965, 
Chapter 3. 


Messiah, A., Mécanique Quantique. Paris: Dunod, 1958. 


3 


SOME SOLUBLE 
PROBLEMS OF 
QUANTUM 
MECHANICS 


“You people speak in terms of circles and 
ellipses and regular velocities —simple movements 
that the mind can grasp—very convenient—but 
Suppose almighty God had taken it into his head 
to make the stars move like that... (He describes 
an irregular motion with his finger through the air) 
... then where would you be?”’ 


Bertold Brecht, Galileo 


1. Introduction 


In the previous chapter we have discussed some of the general 
principles of quantum mechanics. In this chapter, we develop the solu- 
tions of several of the commonest quantum systems. We begin with the 
harmonic oscillator, which plays a fundamental role in quantum field 
theory and provides an introduction to the concept of creation and anni- 
hilation operators. The problem is discussed in one, two, and three dimen- 
sions. The case of two dimensions is important in connection with the 
states of electrons in constant magnetic fields. The three-dimensional 
case arises in the theory of nuclear shell structure. 

The discussion of three-dimensional central force problems leads us 
to simultaneous states of energy and angular momentum. We therefore 
develop the elements of the theory of angular momentum operators. 

We then consider the bound states of the Coulomb potential, and 
solve the so-called “‘Shydrogenic atom” problem. 

Finally, using the methods introduced in Chapter 5 for the problem of 
the scattering of sound waves, we develop the “partial wave” theory of 
scattering of particles by a fixed, bounded potential. 


503 


504 Some Soluble Problems of Quantum Mechanics 


2. Harmonic Oscillator in One 
Dimension 


If a particle of mass m is acted upon by a restoring force proportional 
to its displacement from a fixed point of equilibrium, it is known as a 
‘harmonic oscillator.” 

Its Hamiltonian is 


Pp 2 
= ay tke (9.1) 


if the displacement is in one dimension only. In the theory of the vibrat- 
ing string (Chapter 1) we also met the classical harmonic oscillator prob- 
lem; the “displacement” in that problem was the amplitude of a normal 
mode of vibration. It turns up, in fact, in all linear vibration problems in- 
cluding, for example, the vibrations of an electromagnetic field. Such a 
field, in an enclosure, can also be resolved into normal modes, and the 
Hamiltonian may be expressed in terms of coordinates which are the 
amplitudes of these normal modes, and conjugate momenta. 

Consequently, the problem of the harmonic oscillator plays an im- 
portant role in a number of quantum-mechanical problems of physical 
interest. 

The solution of this problem is straightforward, but contains a num- 
ber of interesting features. We, therefore, explore it thoroughly. 

The Schrodinger wave equation of the problem is 


h? d? 
— oS ety = Ew (9.2) 
If we introduce a dimensionless coordinate 
1/4 
a=(%) * (9.3) 
and a dimensionless energy 
E E 
= ————— = — (9.4) 
AV(kim) he 
this equation takes the simpler form 
ldw 1 
5 ta guy = ef (9.5) 
Putting 
p=- iz (9.6) 
it may be written 


3(p?+q@)b= ep (9.7) 


2. Harmonic Oscillator in One Dimension 505 


The commutation relation of p and q is 
[p,ql=—i (9.8) 


It is, therefore, as though we had chosen units for which f@ = 1. 
The operator on the left-hand side of (9.7) may be written in the 
following two ways: 


4(p?+ gq’) =2(q+ ip) (q—ip) —3 (9.9) 
or 
4(p?+q’) =43(q—ip) (qt ip) +3 (9.10) 
The eigenvalue equation for e then has the alternative forms 
4(q+ip)(q—ip)b= (e+%)p (9.11) 
or 
3(q—ip) (q+ ip)p = (e—2)W (9.12) 


Let us now operate on (9.11) with g — ip, to obtain 


4(q—ip) (q+ ip) [(q—ip)b] = (€+4)[(q—ip)] (9.13) 


Comparing this with (9.12), we see that if f is an eigenfunction with 
eigenvalue e, (q—ip) is an eigenfunction with eigenvalue (e+ 1). We 
write, for brevity 


1 , 
— 9.14 
Vy 69) = at (9.14) 


Thus, at is an operator which produces, from one eigenfunction, another 
for which the eigenvalue is greater by 1. We say, then, that at has pro- 
duced one quantum; we may call it a “‘creation operator.” 

Let us return now to Eq. (9.12) and operate on it with (¢g+ ip) to 
obtain 


(q+ip)(q—ip)[ (q+ ip) ] = (e—4) [(q+ip)#] (9.15) 


Comparing this with (9.11), we see that if w is an eigenfunction corre- 
sponding to the eigenvalue e, (¢+ip)w is an eigenfunction corresponding 
to the eigenvalue (e — 1). Thus, 


1 
= — ] 9.16 
a Va (at ip) (9.16) 


lowers the quantum number by 1, and is designated an “annihilation 
operator” or “destruction operator.” 
Note that the commutator 


[a,at] =4[q+ip,q—ip] =1 (9.17) 


506 Some Soluble Problems of Quantum Mechanics 


The Hamiltonian may be written either 


H = aaj —3 (9.18) 
or 
AH = atjtat+} (9.19) 
or, finally 
H =}3(aat+aja) (9.20) 


All the above could, of course, have been done in terms of the bra and 
ket vectors, without using the position representation. If the original 
coordinate is x and its conjugate momentum p,,, we make the transforma- 
tion (9.3) and put 


] 1/4 
p= (smn) Px (9.21) 
and 
H=h qe Hy (9.22) 
m 
Then 
Hy =2(p?+q’) (9.23) 


and we may follow through the whole derivation starting from (9.9), using 
the vector |) in place of the amplitude w(q) = <q|p) in the coordinate 
representation. 

We have seen that by means of the operator a we can in general 
create from an eigenstate |w) with eigenvalue e another eigenstate aly) 
with eigenvalue e— 1. However, it is easy to show that this reduction of 
the eigenvalue cannot continue indefinitely, but that there is a state of 
minimum (positive) eigenvalue. For consider the magnitude squared of 
a|y), namely, 


(platalp) 
where |y) is the eigenstate of Hy with eigenvalue e. Using (9.19) 
(platalp) = (e—4) (bl) = 0 (9.24) 
Thus, 
e2=h (9.25) 


But if |ys) is the state with the lowest eigenvalue, then a|ys) must not be a 
nonzero vector. Consequently, the lowest eigenvalue must be 


— il 
€9 — 2 


and the other eigenstates obtained by operating successively with the 


2. Harmonic Oscillator in One Dimension 507 


“‘creation”’ operator at have eigenvalues 
€, = n+4 (9.26) 


The corresponding eigenvectors are designated |,). If we choose these 
eigenvectors to be normalized, and put 


|Wn-1) = Ana|Wn) (9.27) 
we see from (9.24) that 
1=|A,|?n (9.28) 
We may take 4, to be real; then 


alw,) = Vn\Wn—1) (9.29) 

Multiplying by at 
atalhn) = n\n) a Vn at|Wn—1) 

so that 

at|Yn—-1) = Vn|bn) (9.30) 

From (9.29) and (9.30) we can write the matrix elements of qg and p: 
(Umlqln) = ml a+ at i) 

ease yet jovi 

~~ ye Om,n-1 i a) Om, n+1 (9.3 1) 
and 

_ 1 = 
(Wl P\n) iS yg nl atin) 


. [ect 
il aig) 5— Sm. nt1 (9.32) 


Problem 9-1: Use the fact that g = (1/V2) (a+aft) 
to show that the diagonal matrix elements of even powers of x in the 


ground state are 
: _ (2n)! (=)" 


Show also that 


2 
(s|x4|s) = #(2s?+2s+ 1) =. s2=1 


508 Some Soluble Problems of Quantum Mechanics 


Let us rewrite the energy and the position matrix elements in terms 
of the original coordinates. The energy levels are 


E,= (n-+4)ifE= (n+2)hw (9.33) 


Since 


1/4 1/2 
q= (FE) x= (2) x (9.34) 


the matrix elements of x are 


(YmltlYa) = 52 {ViaSat VeFT Smet (9.35) 


while those of p, are 


epee ee a ee | Te Se 9.36 
(bm| Pzln) rar. n » } ( ) 


3. Schrodinger Wave Functions 
for the Harmonic Oscillator 


The problem of the wave functions of the one-dimensional harmonic 
oscillator may be approached in several ways. If we are interested in 
generating the functions of low quantum numbers, the simplest method 
is to solve first the equation for the lowest-energy state 


a\fo) = 0 (9.37) 
in the Schrédinger form 
d = 
(4+) wo =0 (9.38) 
which has the normalized solution 
1 1/4 
tn= (2) eres 9.39) 


Then, successive application of the operator 


=a Susy =e _ oO 
at = (q—ip) (4 = (9.40) 


gives the higher eigenfunctions. 


Problem 9-2: Obtain the formulas for the normalized 
oscillator function up to n = 4: 


1/4 
Wr = (=) gel? 


3. Schrddinger Wave Functions 509 
1/4 
b= (ge) 2g*—1)e*? 
1/4 
bs= (gr) (2q?—3a)er#2 
1 1/4 
ba=(s7@q)  (4a*—12g?-+3)e-* 


Problem 9-3: Find the momentum eigenfunctions of 
the one-dimensional harmonic oscillator. [Rather than trying to do it 
by Fourier transformation, calculate them from first principles. ] 


A more direct approach relates the wave functions to Hermite poly- 
nomials and confluent hypergeometric functions. Let us substitute 


p= eo PP u (9.41) 
in the Schrédinger equation (9.5) 


5 Geet (e-dahy=0 (9.42) 
We obtain 
u’—2qu'+2nu=0 (9.43) 


where the prime indicates differentiation with respect to g, and we have 
substituted the known values of e. This is the Hermite equation (4.303) 
for variable V24q, so that 


Wn = Cre-©?H,( V2q) (9.44) 


These may be written in terms of confluent hypergeometric functions by 
means of Eqs. (4.318) and (4.319). 

There is a simple way to find the normalization factors C,, by using 
the generating function for Hermite polynomials given by Eq. (4.309) 


V2¢s 9—(1/2)s2 — - s" 
evi ¢ »» H,(V2q) = (9.45) 
Putting 
V2s =Co 


and multiplying by e~”? this becomes 


e299 p—o2/4 — te C. 7 (9.46) 


510 Some Soluble Problems of Quantum Mechanics 


Similarly we may write 


e~fI2 pt¢e 7714 = = » = ae (9.47) 


m=0 


Let us now multiply together (9.46) and (9.47), and integrate the result 
with respect to g from — © to », On the left-hand side we get 


[ee] fee] 
oe (o2+72)/4 J ef ett) dq = er J e  la-(o+7)/2)}2 dq 
—00 ~—O 


= Vier? 
Therefore, 
. = sve dq 1 or 
Vier? = > Libnibm 24 2 Vm+ni2 yi! (9.48) 
n,m=0 


It follows that all the terms on the right with m + n are zero (verifying the 
known orthogonality of the oscillator functions); on the other hand, equat- 
ing on the two sides the coefficients of (o7)” yields 


va =|- Wn dq ASCH (9.49) 
and therefore 
1 ye 
C,= Fea (9.50) 
Thus, the normalized oscillator functions are 
b= |B] ea V2) (9.51) 
ava 


4. Elements of the Theory of 
Angular Momentum 


The Poisson brackets of arbitrary components L;, L; of the classical 
angular momentum operator may be calculated to be 
{Li, Lj} = ijtel-r (9.52) 


where €;;, is the permutation symbol. Using the rule for relating the 
commutators of quantum operators to Poisson brackets of the corre- 
sponding classical variables, we are led to the commutation relations 


{L;, L;] = ihe,;,.L;, (9.53) 


4. Elements of the Theory of Angular Momentum 511 


Written out in full, these relations are 


(L,, L,] = iaL, (9.54) 

[L,, L,] = ihL, (9.55) 
and 

[L., L,] = ihL, (9.56) 


As we will see, the eigenstates and eigenvalues of angular momentum 
may be determined from these commutation relations alone. However, 
the states determined cannot all be associated with orbital angular 
momentum about the origin of coordinates, but rather require the intro- 
duction into the physics of the problem of an intrinsic angular momentum 
or “spin.” It is convenient to reserve the symbol L for orbital angular 
momentum, and to use J to denote the general operator satisfying the 
commutation relations (9.54) to (9.56). In using these commutation rules 
alone, we derive a theory of all angular-momentum-like quantities; 
whether the states deduced exist in physics is a matter of observation 
rather than of theory. 

Let us then, proceed to the determination of angular momentum 
states. 

We note first, of course, that since the components of angular 
momentum do not commute with each other, simultaneous eigenstates do 
not in general exist for any two components. 

On the other hand, each component commutes with J? =J,?+-J,?+J,7. 


For 
LJ?,J2) = (J2+Jd/4+J?, Je] 
= (Jy, Jz] +[J2, Je] 
But in general 
[A*, B] = A?B— BA? 
= A(AB—BA)+(AB—BA)A 
= A[A,B]+[A,B]A (9.57) 
Therefore, 


[J?, Je] =Jy(—itJ,) — ih ,Jy+JS,(ihJy) +ik&J,Jz=90 (9.58) 


It follows similarly that J, and J, commute with J’. 
We arbitrarily choose to work with eigenstates of J? and J,; we desig- 
nate the eigenvalues J” as j(j+1)h? and those of J as mh. No a priori 


assumptions are made about the numbers j and m. The simultaneous 
eigenstates are designated | jm). 


512 Some Soluble Problems of Quantum Mechanics 


In a manner suggestive of the calculation of the energy levels of the 
harmonic oscillator, we may introduce operators which raise or lower the 
quantum number mm for a fixed j. Let us introduce 


J. =J,tidy (9.59) 
The commutation rules between J,,J_ and J, are easily verified to be 
(J..J,] =A. (9.60) 
and 
[J,,J_] = 2AJ, (9.61) 
J’ may also be written quite easily in terms of J,, J_, and J,: 
2=J,J_+J/—hJ, (9.62) 
or 
P=J_JI,t+JSZ+hd, (9.63) 


Consider now the equation, which follows from (9.60), 
[Ji.J2]| jm) =—AJ,|jm) 
which, since | jm) is an eigenstate of J,, becomes 


mi(J.|jm)) —J2(Js|jm)) =—A(J4|jm)) 
or 
J2(J4|jm)) = (m+ 1)h(J,|jm)) (9.64) 


Thus the operator J, has raised the value of m by 1 without affecting J?. 
In fact, from the relation 


[J?, J,] =0 (9.65) 
operating on | jm) we see that 
J?( J, jm)) = j(f+ 1)#2(J4|jm)) (9.66) 
From (9.64) we may write 
Jx|jm) = Aim|j,m+ 1) (9.67) 


where A;,, is a constant which may be taken to be real. Assuming the 
states | jm) to be normalized, A; may be calculated by taking the magni- 
tude squared of both sides of (9.67). For this purpose we note that the 
conjugate of J.) = (| J_. Therefore, 


(jm|J_J,|jm) = 42, (9.68) 
From (9.63) this may be written 


(jm| J? —J2—hJ,| jm) = Az, 


4. Elements of the Theory of Angular Momentum 513 


or 
AZ, = (7 (J +1) —m(m+ 1) ]#? (9.69) 
Thus 
Ji|jm) =hVj(j+1) —m(m4+1)|j,m4+1) (9.70) 


We may proceed in a precisely similar manner starting from the 
commutation relation between J_ and J,, to show that 


(a) = J_| jm) = Bym|j,m—1) (9.71) 
(b) Bjym=AVj(j+1) —m(m—T) (9.72) 


Suppose, then, that we have an arbitrary state | jm). By continued 
application of the operator J,, we may obtain states with m increasing by 
steps of 1, j remaining fixed. However, (9.69) implies that m may never 
exceed j. The contradiction can only be resolved if at some point A;, = 0, 
or m = /; the chain is then broken. 

But now let us operate successively on this state of maximum m, viz., 
|jj), with the operator J_. m may then be continually decreased by 1, 
giving values j—1, j—2,.... This time, it would seem that m could be 
reduced indefinitely. But that is also not true, since from (9.72) m cannot 
be less than —j. For if it were, Bj, would be imaginary, whereas we 
assume it originally to be real. This time, then, the only way to break the 
chain at the lower end is to assume that the minimum value of m is —j, 
so that no further states may be produced from |/j,—/). 

The situation, then, is this: that starting from m= j, and reducing it 
by steps of 1, we reach a state m = —j. It follows that 7 must be half an 
integer, or 


J = 0,4.153: 233.2% (9.73) 


while, for a given j, mruns by intervals of unity from — j to /. 

Starting from the state | jj) (‘stretched state”) successive application 
of the J_ operator produces all the other states | jm). 

For j equal to values / it is straightforward to determine Schrodinger 
wave functions for the states. 

The Schrodinger operator for L, is 


ee eee! (9.74) 
L,=—ih (x2 yo) 


Using the relations between Cartesian and polar coordinates 


x=rsin 6cos @, y=rsin 6sin g¢, z=rcos@ (9.75) 


514 Some Soluble Problems of Quantum Mechanics 


we see that 
> —_rsin @ sind —-+rsin 6 cos a 
ad ax ¢ ay 
ee! 
oy y Ox 
so that 
L,= — ih35 (9.76) 
But now 
L,+iL, =i|z (2 -+i2)— (x+iy) S| (9.77) 
and 
LiL, =i| (x—iy) x 2(gx-i3)| (9.78) 
Ox dy ° 
Putting 


€=x+iy=rsinde*, ¢*=x—iy=rsinOe* (9.79) 


a,,8259 4 a 5a 
ax t lay =2 ae”? ax ay 2 a€ (9.80) 
Therefore, 
0 
L,+iLly=h |22 598 9e* 2 (9.81) 
ay aalex 2 2 
L,—ily=h lé az 3E (9.82) 
We may now write 3/36 and 0/d¢ in terms of é, €*, and z: 
re) 0 0 
oO id = tO uti 
ry; rcos 6e pee ee ae rsin 6 (9.83) 
~j 2 =rsing ce 2 -—rsino e-* (9.84) 


. ag og* 


If the second of these equations is multiplied by cot 6 and added to the 
first, and the result is multiplied by 2 it is found that 


e «(Si icot go ad = 2z se 8 az (9.85) 
It follows from (9.82) that 


L,—iL,=—he-* (5-7 icota% <) (9.86) 


5S. Matrix Representation of Angular Momentum Operators 515 


If, on the other hand, after (9.83) is multiplied by cot 6 (9.84) is 
subtracted from it and the result is multiplied by e*, we obtain instead 


io (9_ #)=2 72 O 
e (3 +icoto 2 ad 5 gF oa (9.87) 
It follows from (9.81) that 
L,+iL, =he® (j5+ scot 02) (9.88) 


From (9.62) it is possible to calculate the operator for L?: 
L? = LL + L,? —_ AL, 


re] re] 
— — £2 pid | — —id 
=—h’e (St ico 0%) e (5- icot 02) 


Since there are simultaneous eigenstates of L? and L., these states 
may be written 


Wim = Sim(9) e" (9.89) 
Substituting this into 
Li bim = UI Mh bin 


we obtain for S,,, the equation 
d d 
-|$- (m—1) cot a|| 5+ moot | Sim = (+ 1) — m(m— 1)] Sun 


Referring back to Eqs. (3.42), (3.43), and (3.44), we see that the wave 
functions 


Wim = Sim e*”® (9.90) 


are spherical harmonics; their properties were very thoroughly in- 
vestigated in Chapter 3. In the notation of that chapter 


Wim = Omn(9) eime = Yinl9, d) (9.9 1) 


5. Matrix Representation of 
Angular Momentum Operators 


Let us calculate the matrix elements for the simplest cases: j = 4 and 
j= 1. In the first case we put J=s, designating it as the spin operator. 
Describing the state |3,4) by the column matrix (3) and |4, —3) by (9), the 


516 Some Soluble Problems of Quantum Mechanics 


matrices for s,, 5,, and s_ become 


_(1 0\%_1 
0 Wet 
i (i 0) has hos (9.93) 
_(0 0\,_1 
= * a) h= a ho (9.94) 
From the latter two it follows that 
_(0 1\_1 
oe : 0) =F hors (9.95) 
and 
_(0 -)%_1 
Sy= (; 0/579 hoy (9.96) 


The matrices for the case j = 1 are constructed from matrix elements 
as follows: 


Problem 9-4: The dipole-dipole interaction energy 
of two particles with magnetic moments pw, = 2,80, and p. = 22.80% is 
1 3 -r -r 
Find the eigenvalues of this operator when the individual spins are } and 


1, respectively. 6 is the Bohr magneton eh/2mc and the g’s are the 
“‘g factors” of the particles. 


Problem 9-5: Two “localized” spins o, and o, 
interact through the ‘exchange energy” Jo,-o,, and are placed in an 
external magnetic field Hy. The g factors of the spins are g, and g5. Deter- 
mine the eigenstates of the system if each spin has quantum number }. 

Show that, at zero field, the state of lowest energy is the singlet 
state (antiparallel spins). At what field does the state of aligned spins 
become the lowest state? 


Problem 9-6: Three spins (s=4) interact with the 
Hamiltonian 


H= J(G, 5 G2+ > - O3;+ G3 . C;) 
Determine the eigenstates of the system and their degeneracies. [Con- 
sider the operator s? = (0, + a, + @3)?.] 


SSS —e ar a eS 


6. Schwinger Formulation 


(1JZ|1) =A 
(— 1|J2|— 1) =—h 


517 


where |m) is a shorthand for | jm). For J, we have as the only nonzero 


matrix elements 
(1|J,|0) = V2% 
(O| J, |—1) = V2h 
Finally for J_ the only nonzero elements are 
(O|J_|1) = V2A 
and 
(—1|J_|0) = V2A 


The matrix operators are then easily constructed as follows: 


from which we obtain 


and 


6. Schwinger Formulation 


(9.97) 


(9.98) 


Schwinger has given another representation of angular momentum 
operators which is very versatile. It starts from creation and annihilation 
operators at and a of the sort introduced in (9.14) and (9.16) for the 


518 Some Soluble Problems of Quantum Mechanics 


harmonic oscillator problem; these satisfy the commutation rule 
[a, at] = aat—ata=1 


as already given in (9.17). 

If we consider two oscillators, we may associate such operators 
with each; we call them a,, a,f, a. and a,t. The operators a,, a,t each 
commute with a, and a,t. Let us form the row matrix 


(a,fa.t) = af 


and the column matrix 


which we call “spinor operators.” 
Let us now construct the operators 


Ax = atS a, ou = af Sa, Sz = afS,a (9.99) 
The operators for ¥%,, %,, and %_ are seen immediately to be 


Jz th (ata, — aytaz) 


= Mi(n, — ne) (9.100) 

where we have put n = afta, 
£+= hata, (9.101) 
£-= hata, (9.102) 


Calculating the commutation rules, we find that 


A2f+— I+ 42 = ML (m — ne) asta, — ayta,(n,— ne) ] 


= h?a,ta, 
=h J. (9.103) 
since 
NzAy = AgNz— Ag (9.104) 
and 
nya,t = a,fn,+a,f (9.105) 
Similarly, 


JA 24 -— $-f2= th [ (ny — ne) Agta, — agta,(n,— ne) J 
= h? aot a; 


=—h $_ (9.106) 


6. Schwinger Formulation 519 


Finally, 
f+f-— ff. = [atagagta, — agta,a,faz] 
= #2[(1+n.)n,—n(1+n,)] 
= (n,— np) fh? 
=2h f, (9.107) 


These commutation relations are exactly those of (9.60) and (9.61); the 
#’s are then representations of angular momentum operators and may be 
used in the development of angular momentum theory. We henceforth do 
not distinguish the .’s and the angular momentum operators J. 

Since the eigenstates are determined exclusively by the commutation 
relations and are designated by the quantum numbers j and m, the eigen- 
values are exactly as already shown; that is, those of J? are j (j+ 1)f? and 
those of J, are mh, where |m| < j. 

From (9.107) we see that the quantum number 


m= 4(n,— Ne) (9.108) 
Following (9.48) 
2 = hagtayayta, + th? (n, — m2)? + 3h? (my — ne) 
= h?[no(1+7n,) +4n,? —43nyne t+ 4n.? +4, — tne] 
= h?[3(ny+ ne)? +4(m, 4+ nz) ] 
which is equal to j (j-+ 1)? where 
J=43(my 4+ ne) (9.109) 


From (9.93) and (9.94) we see that a,f increases and a, decreases 
both m and j by $; on the other hand a. increases j by + while decreasing 
m by 3, while a, increases m and decreases j by the same amount. 

With these prescriptions, we can generate all angular momentum 
states from the ground state of the two operators, which we designate 
0). Recalling that 


at|n) = Vat+i1|n+1) 


and 
a|n) = Vn|n— 1) 
we See that 
[s4) = a,7|0) (9.110) 
and 


|s, 4) = aet|0) (9.111) 


520 Some Soluble Problems of Quantum Mechanics 


Similarly, 
|1,—1) =4(a,t)?|0) (9.112) 
\1,0) = a,ta,t|0) (9.113) 
|1,+1) =4(a,t)?|0) (9.114) 


By the method of induction it is easily seen that 


. (a,tP*™(a,ty-™ 
m) = ——— |0 (9.115 
im) = miami 

Problem 9-7: Prove (9.115). [HinT: Consider the 
effect of each of the operators a,}, a, Qe, ao.] 


7. Addition of Angular Momenta: 
Clebsch—Gordan Coefficients 


In this section we consider the problem of the addition of two 
independent angular momenta, J, and J,. We designate |j,m,) as eigen- 
states of J,2 and J,,, | jor.) as those of J,” and J,,. The hypothesis of 
independence means that all components of J, commute with all com- 
ponents of Jo. 

Direct products of the eigenstates of the two angular momenta 
constitute a complete set of eigenstates of the system, and may be 
designated | j, j.72,me). 

However, the components of 


J=I,t Jz (9.116) 


satisfy the general angular momentum commutation rules, so that there 
exist common eigenstates of J? and J,. We designate these as | jm) where 


Phim) =j(i+ 1h? | jm) (9.117) 
and 
J,\jm) = mi| jm) (9.118) 


Strictly speaking, we should indicate that these states are the result of 
adding angular momenta J, and Jz, and writing them |j,j.jm). We supress 
the j, and j, for brevity in writing; we might then just as well do the same 
for 

lisJotM Me) 


and designate them merely as |7mmz). 


7. Clebsch—Gordan Coefficients 521 


In summary, |m,m,) are eigenstates of the commuting operators 
Ji?, Je”, Jiz, Jez, while |jm) are eigenstates of J,?, J.2, (J; +J-)?, and 
(J12+ Jez), all pairs of which also commute with each other. 

Since each set of states constitutes a complete set or basis for the 
states of the two angular momenta, each vector of one set may be ex- 
pressed as a linear combination of those of the other. 

Thus we may write 

liizim)= > |i JetMe) (jijolMe|j1Jojm) (9.119) 
mitme=m 
The coefficients (j,j.m,m2|j,j2jm) are known as the “Clebsch—Gordan”’ 
coefficients. 
The relations may be expressed the other way about, viz, 
jitje 
|isJe?ymMe) = a Liieim) (jjoim|j,j2mime) (9.120) 
where, in all terms on the right, m = m,+m,. The coefficients here are 
the conjugates of those in (9.119); however, since all phases may be 
chosen = 0, they are actually the same quantities. 

If we consider the state |m,m,.) = |j,j.)!, it must be the same as. 

|jm) = |j, +Je,. J: +Je). For in general 


m+m,=m (9.121) 


But j cannot be larger than j, + j2, for if it were, m could also be. But we 
know that its maximum value is j, + jo. 
Suppose next we consider the states 


lis.Je— 1)’, lis — 1,0)’ 
They correspond to the same 
m= t+je—1 
We know that there is a state 
| jim) = lis + jo, ji t+Jje— 1) 


There must, however, be another state with the same m. It must therefore 
correspond to 


J=hthe— | 


Proceeding in the same way, we see that there are (/+ 1) states |m,mz,) 
with m,+m, =j,+j,—l. These must be expressible in terms of states 


‘Because there is a risk of confusion between the vectors | jm) and |m,m,), we shall 
distinguish the latter by a prime in what follows. 


522 Some Soluble Problems of Quantum Mechanics 


i,m =j,+j2—1) where 
JRHhtjeiAtie—1,..-,fArtp—l (9.122) 


This is, however, only true if / is not too large. To see what the largest 
value of / is, we note that the number of independent states |7,m_) is 


N(mymz) = (27, +1) (2j2+ 1) (9.123) 


The number of states | jm) is 
l 

N(jm) = > [2(Atie—s) +1] 
s= 


= (2( ji: tJe) +1] (14+ 1) —lU1+1) (9.124) 


But we must have N( jm) = N(m,mz,), since both sets span the same vec- 
tor space. Thus 


Ajijet+ (fit je) +1 = Ate) +14+21(j, +i) —P 
or 
2? —21(jit+je) + 4ije = 0 
or, finally 


(1—2j,) (I—2j.) =0 (9.125) 
However, / cannot be the larger of j, and j., since this would lead to 
J=J<—J> 
where j. is the lesser and j, the larger of the j,, j.; however, j must be 
positive. Therefore, 


l= je 
and the lowest value of j appearing in (9.124) is 
J = fotjce—2je = jo—Je= lh Jel (9.126) 
,We have thus established the limits 
lit del SJ SA th (9.127) 
this is known as the “‘triangular inequality.” 


8. Techniques of Calculation 
The case in which 
J=h)itJe 


(“‘stretched case’’) is easily handled, since we merely start from the 


8. Techniques of Calculation 523 


relation 
J =jitje,m=) + jz) = [7m = ji, Mz = jo)’ 
or 
lis tes di te) = lie)’ (9.128) 


and operate successively with J_. Note that on the right, | 7,72)’ is really 
a direct product state; we operate on it with J,_, Js_, the first acting only 
on the j, state and the second on the /, state. Using (9.71) and (9.72) on 
(9.128) yields 


(ji tde) Atiet 1) —(itie) (iti 1) lates Ja tye —1) 
= VIC +1) —fi (i —- 1) ii 1, je)! + Vij2(jet+ 1) —Je(j2—-1) 
X |yi.Jo— 1)’ 


which may be simplified to 


or eee de as i eee oe ee ee 
Jitjes ji tje— 1) = yf |i —1ye)’ +) lie 1 9.129 
| 1 Ja. Ji T Je ) jth Je) jiPis ii J2 y ¢ ) 
One may continue to operate with J. = J,_+J,_ until we reach the 
state | ji +je,— (ii +e) ). 
How, then, do we deal with j = j, +j.—/, where / > 0? The simplest 
procedure is to write for instance 


li —I, jo)! = Cold: + JosJi + j2—l) +. Cli + Je— La + je l) 
++ ClAti—bitia—D Ol”) 


and operate successively on both sides with J, = J,,+J2,. Operating once 
we get an equation without C,; the second time, C,_, also drops out, etc. 
After / operations, we have only Cy. Working back through the equations, 
the second to the last permits us to determine C,, and so on. At the second 
stage, we are able to write |j,+/j.—1,j,+j.—Jl) in terms of the states 
|m,m,) because (j;+ jo, j1 +j2—l), being a state of the ‘“‘stretched case,” 
may be so written. Thus, at each stage we calculate the state | jj) in terms 
of states which have previously been expressed in terms of the product 
states |m,m,). Then, by successive application of the J_ operator, we ob- 
tain all the states | jm). 

The preceding ideas are best illustrated by carrying out the addition 
of angular momenta of low quantum number, viz., 


ll 
— Ne 


A ji =3, Je 
B A= 1, Je 
Cc A= l, je = 


524 Some Soluble Problems of Quantum Mechanics 


CASE A 
We start from 


|11) = [23)’ 


(9.131) 


where the state on the left is of the type |jm) and that on the right is 


of the type |7,m_)'. Operating with J_ gives 
V2|10) = |s, —3)’ + |-4,4)’ 


(9.132) 


Recognizing the states on the right as product states, and writing | j, = 4, 


m, = $) as a, and | j; = 4, m, = —4) as B,, (9.131) becomes 
|1 1) = a 
and 


_ 1 
|1 0)= VW (a B,+ B,a2) 


Operating once more with J_ gives 
|1,—1) = Bi Be 
The j = 0 state may be obtained by putting 
a,B2 = Cy|10)+C,|00) 
and operating with J,. This gives 
QQ = CoV2|11) 
so that Cy = 1/V2. But then 


C,|00) = a1 Ba eI 0) 


= V7 (a1 Be = Bia) 


The normalization clearly requires that C, = 1, so that 
1 


OT aa 


(a Be — Bia) 
Be fi yr ee a 
Va les 3)’ —|-4,4)'} 


CASE B 
This time the starting point is the equation 


| jm) = |$3) = [12)' = |myme)’ 


(9.133) 


(9.134) 


(9.135) 


(9.136) 


8. Techniques of Calculation 525 


Successive operations with J_ yield 
|$4) = V¥|04)’ + VE |1,—4)' 
8, -4) = VE|—1,4)' + V3 |0,—34)’ 


and 
8, -—2) = |- 1, 4)’ 
The states of j = 4 are obtained by starting with 
|m, mz)’ = |04)’ = Co|84) + Ci[2 2) 
Operating with J. leads to 
V2|14)' = V3Co|8#) 


whence 


It follows that 
C,|34) = |04)' — V3 |34) 
Substituting for the second from (9.137), we find that 


C,b4) = #108)’ 2 1,4)" 
The normalization condition gives C, = 1/3, so that 
= J ro _ ’ 
4) = Va |0 4)’ — V%|1, —4) 


Operating on this with J_ gives finally 


_ 1 —_— ’ —_ mS _ 2 —_— ’ 
bs —2) =e (0. 4)’ + V4] —1,4)') — V2V2)0, —4) 


or 
|b, —4) = V8|- 1,4)’ — VE 10, —3)’ 


CASE C 
The m = j state for the stretched case is 


| jm) = |22) = [1 1)’ = | me)’ 


(9.137) 
(9.138) 


(9.139) 


(9.140) 


(9.141) 


(9.142) 


(9.143) 


where, once again, we use a prime to distinguish an |m,m,) state from a 


| jm) one. Operating once with J_ gives 


_ 1 », 1 ' 
21) =~ |10)’ +101) 


(9.144) 


526 Some Soluble Problems of Quantum Mechanics 


Operating again with J_ yields 


1 2 1 

20) =— =}1, —1)'’+——]|00)’ +——|--1, 1)’ 9.145 
[2 0) Vi | ) Vi | ) Vi | ) ( ) 

Continuing, we get the remaining states of the “stretched case.”’ 

l 1 
2, —-1) =——|90, — 1)‘ + —|[-1,0)’ 9.146 
| ) Vi! ) V5 | ) ( ) 
and 

(2, -—2) =|-1,—-1)’ (9.147) 


The starting point for the j = 1 states is the equation 
110)’ = C,|2 1) +C,|11) 
Operating with J, gives 
V2|11)’ =2C,|2 2) 
so that Cy = 1/2. Therefore, 


C,|11) =|10)' aevaty eked +5101) 


Since C, = 1 because of the normalization requirement 


1 | 
11) =—=]10)’ -—=]01)’ 9.148 
Operating with J_ we obtain 
1 | 
10) =—=|1,--1)’ -——=[-1,1)’ (9.149 
10) =e IL-1)’ Te IHL I) ) 


Finally, to obtain the state |0 0), we put 
|0 0)’ = C,|20)+C,|10)+C,|00) 
Operating once with J, produces 
V2(|1 0)’ +|01)') = CoV6 [2 1) +C,V2 |1 1) 
A second operation gives 
4|1 1)’ =2C,V6 |2 2) 

so that 

ca 

° V6 


Therefore, 


C,|1 1) =]10)'+|01)’—V2|2 1)=0 


9. Two-Dimensional Harmonic Oscillator 527 


and 
C; =0 


Consequently, 


C,|00) = |0 0) - 12 0) 


= ooy {en ~1)' +5100)" +b 1, 1)'| 


The normalization condition gives C, = 1/ V/3 and therefore 


I 
00) =—=|00)'— 1,1)’ 
100) = 100) ) 


1 
—— 1,—1 ‘oe fe 
V3 eo | 
[See Problem 9-8 on page 528.] 


9. Two-Dimensional Harmonic 
Oscillator 


If €, 7 are rescaled variables, 


mk 1/4 
n= (5) y 
the oscillator “ananee 
ay 9a +5 24 42 
a (Se zt aye +=k(x?+ y)b = Ew 


may be simplified, as in the one-dimensional case, to 


2 
-5 (4448 ¥ oe) +7 5 (ee + nb = 
where again, as in (9.4), 
_£- 
© ho 


(9.150) 


(9.151) 


(9.152) 


(9.153) 


(9.154) 


Equation (9.154) may be solved in Cartesian coordinates, by separation 


of variables, that is, by putting 


X = o(€)x() 


(9.155) 


which individually solve one-dimensional oscillator equations with 


Problem 9-8: Determine the states obtained by adding j, = 2 andj, = 1. The results are given in the 
following table. 


[33) |32) |[31) [30) [3,-1) [3,-2) |3,-3) [22) [21) [20) [2,-1) [2,-2) [11) [10) [1,—1) 


121)’ i 


a 
S&S 
| 

S 4 


82S 


SOIUBYDOW WINJULNDH JO SWa|qQolg ajqnjos awos 


9. Two-Dimensional Harmonic Oscillator 529 


energy eigenvalues €, and €,. The total energy is 


€=€,+& (9.156) 

where 
€, = (n, +4)ho (9.157) 
€& = (n.+4)ho (9.158) 

so that 
e=(n, +n.+ 1l)how (9.159) 


The energy states are degenerate, all states for which n,+ 7, has a 
definite value n having the same energy, although they have different 
wave functions. For a given n, the degeneracy is (n+ 1)-fold. 

If we write the wave equation in cylindrical coordinates, 


€é= pcos ¢, N= psing (9.160) 

it becomes 
1/2, Lav, La 
210p” pap pay’ 


Recalling, from Eq. (9.76), that the operator for the angular momentum 
about an axis perpendicular to the plane of motion is 


oe |+40%v = ey (9.161) 


_—_ 8 
L,=—ih a0 
we see that the equation for a state with angular momentum mh is 
_1[@R ,1dR_ m_ . 
3|etss ao pe R|+40°R=eR (9.162) 
where the wave function is 
wb = e'""R(p) (9.163) 
Putting 
2 
. =z (9.164) 
Eq. (9.162) becomes 
m2 
ZR" + eR’ +(ex—"F— 2) R = 0 (9.165) 


Comparing this with (4.327), we see that the solution may be written in 
terms of the confluent hypergeometric. The coefficients in that equation, 


530 Some Soluble Problems of Qauntum Mechanics 


in order that it may reduce to (9.165), are 
2a—k=0, y—2s=1 
a(a—k)=—1, (y—2s)a+k(s—B) =e 
2 
s(y—s—1) =7 


These have the solutions 


s=t7, y=aom-+lil, a=1 

k=2, B=t(1+m—e) 
The solution s = m/2 is finite at the origin. Thus 

R = Ce?z"?F (4(1+m—e), m+ 1, 2z) (9.166) 
or, expressed in terms of p 

R= Ce"? oF ($(1+m—e),m+1, p?) (9.167) 


The confluent hypergeometric series, if it does not terminate, behaves for 
large p like e”. It follows that, to behave properly at infinity, the series 
must terminate. That is to say, we must have 


4(1+m—e)=—n (9.168) 


where 7 is an integer. The energy levels are therefore given by 


€=2n+m+1 
or 
E= (2n+m+1)ho (9.169) 
The radial wave function R,,,, is then 
Crme Ph oF (—n,m+1, p”) (9.170) 


Comparing with (4.295) we see that this may be expressed in terms of 
associated Laguerre polynomials 


Re = Dame7?”!? p™ LE” ( p?) (9. 171 ) 


Dam iS @ normalizing factor. 
Finally, the wave function is 


D m 2 s 
Yam = ors er 2o™L™ ( p*)em (9.172) 


There is also a very convenient solution of the two-dimensional 
problem in terms of operators. If a,, a,t+ are the annihilation and creation 
operators of the é-oscillator and a,, a,+ are those of the y-oscillator, we 


9. Two-Dimensional Harmonic Oscillator 531 


can introduce operators associated with € + in, that is, 


b. = ei (a, + ide) (9.173) 
l 
bit = —=(a,t tiaet 9.174 
V2 ( 1 lag ) ( ) 
It follows from the commutation relations for the a’s that 
[b;, bj] = 0 = [b;F, bjt] (9.175) 
[b;, bst] = 855 (9.176) 


where i, j may be + or—. 
We may also calculate 
N, = b.tb, = , (ayta, + ayta,) +5 (ayaat—aa,t) (9.177) 
and 
N_= b_tb_= : (a, fa, + agtdas) = (a,a2t — a,a,T) (9.178) 


The second operator in the above equations may be expressed in 
terms of coordinate and momentum operators as 


5 (ayast — agast) = GUE + ipo ips) — (n+ ip ME— iol 


1 
= 5 (SPn— MP ¢) (9.179) 


1 
5 L, 
L, being the operator for the component of angular momentum perpen- 
dicular to the plane of motion. 


In view of the fact that 
H = a,fa,+ azta,+ 1 (9.180) 
we see that 
H=N,+N_+1 (9.181) 
while 


L,=N,—N- (9.182) 


Thus, simultaneous eigenstates of N, and N_ are eigenstates of the 
Hamiltonian. Also because the b’s and bt’s obey the same algebraic 
relations as the a’s, and appear in the same way in the Hamiltonian, 


b.t|N«) = VN.+1|Nz+1) (9.183) 


532 Some Soluble Problems of Quantum Mechanics 


and 
b.|Nz) = VN. |Nz— 1) (9.184) 
The ground state is 
|N.N_) = |00) (9.185) 
being given by N, = N_ = 0. The general state is 


|N.N_) = Ta Ost (b-1"-0 0) (9.186) 
In this state 
e=N,+N_+4+1 
or 
=(N,+N_+1) ho (9.187) 
and 
L,=(Ni—N_)h (9.188) 


in ordinary units. 
By virtue of the easily verified commutation relations 


[L., bat] =+ bs (9.189) 
[L,, b.] = +b: (9.190) 


we see that bf, b_ increase L, by one unit while b_t, b, decrease it by 
the same amount. 

We may therefore independently raise and lower the energy and the 
angular momentum; for example, b,b_ keeps L, the same but decreases 
the energy by two quanta, while b,{b_ increases L, by two units while 
leaving the energy unchanged. 

Note, however, that states of odd angular momentum must have odd 
numbers of quanta, while those with even angular momenta have even 
numbers of quanta. 

Schrodinger wave functions for the problem may be obtained explic- 
itly by operator methods. Using the notation of (9.185), the ground state 
is defined by the equations 


b,|00) = b_|00) =0 (9.191) 


The Schédinger operators may be written in terms of € and 7 as 


be = a (ay ¥ ity) =s|e+3i(n+2)| (9.192) 
bit = Te (ait tiagt) = =5|¢-S+i( -2)| (9.193) 


10. Three-Dimensional Oscillator 533 


In cylindrical coordinates these become 


b= (ot E15) (9.194) 
e# re) =! 0 
b.f = 5 =( apt D +2) (9.195) 


Let us designate by x,,, the wave function of the state for which the energy 
is nhw and the angular momentum mh. Then 


bsXnm = VE(n+ mM) Xn—1, m—1 (9.196) 

b-Xnm = VE(n—=M)Xn-1,m+1 (9.197) 
bs+Xnm = VE(n+ m) + 1Xn41,me1 (9.198) 
b_tXnm = VE(n—m) + 1Xn41,m-1 (9.199) 


if the x’s are normalized. The ground state wave function satisfies the 
equation 


Fr) 
(0 +2) Xoo = 0 (9.200) 
for which the normalized solution is 


Xoo = seer (9.201) 
T 


The wave functions for the states n= 1, m=+1 andn=2, m=0, 
+2 are easily calculated to be 


X1,+1 >= Vi pe Pl? exe (9.202) 
] . 
X2,+2 = Vin? e P7l2 ex2ie (9.203) 


—_ 1 2 —p2/2 
X2,0 Var (p l)e (9.204) 


Problem 9-9: Calculate the various states corre- 
sponding to n = 3, ViZ., x3, +3 and x3, +1. 


10. Three-Dimensional Oscillator 


Once again, we may make use of the fact that the equation is separ- 
able, and find states which are products of three one-dimensional 


534 Some Soluble Problems of Quantum Mechanics 


oscillator states. The operators a,, a, for the x and y directions may be 
supplemented by the operator a, for the z direction. Designating the 
ground state by 


|nynonz) = |000) (9.205) 


we may write the general state as 


1 
ee se : ni n2 n3 . 0 
ia (a,t)” (det )”(ast)™|0 00) (9.206) 


|7yN2N3) = 
The energy is 
€=n,tn.tn,t+% (9.207) 


The various states for which n,+n.+n; =n have the same value are 
degenerate; the number of them is 
_ (n+2)!_— (n+1)(n4+2) 

D,= ie aR ae (9.208) 
The individual states (9.206) are not eigenstates of angular momentum, 
but linear combinations may be found which are. States of odd n have odd 
angular momentum, while those of even n have even angular momentum. 
This follows from the fact that under a parity transformation 


Be Aas X,Y, Z 


the wave function of states of odd angular momentum change sign, while 
those of even angular momentum do not. But the a operators change sign 
under this transformation; therefore, an even number of them must lead 
to even angular momentum states and vice versa. 

We can easily see that, for a given n, all the angular momentum 
states n,n—2,...,0 or 1 must appear. For the sum 


(2n+1)+[2(n—2)+1]+---+30r1 
1S 
(n+1)(n+2) 
2 


Angular momentum eigenstates may be constructed out of a,, dp, 
and a, as follows: Introduce 


bs 5 (a, + iag) 
as in (9.192) and also 
bo = a3 (9.209) 


10. Three-Dimensional Oscillator 535 


Commutation rules of the form (9.175) and (9.176) again apply, where 
now i,j may be 0,+1. 
Just as in the two-dimensional case 


H=N,+N_4+N,+? (9.210) 
and 
L, = Ni—N- 


In terms of the b operators we may construct simultaneous eigenstates 
of H and L,: 


= I Ny N_ No 9.211 


There remains the problem of finding eigenstates of L?. 


Let us designate by ¢,,, the Schrodinger wave function for the state 
with principal quantum number 


n= NotNi+N- (9.212) 


and angular momentum quantum numbers / and m. 
The lowest state ooo is determined by the conditions 


bsdo00 = bdo = bohooo = 9 
Using the fact that b.. are given by (9.194) and b, = a3 by 


0 
b, = — 
0 ere 


we see immediately that 


l 
dooo = = en-(e2+22)/2 — = e7rl2 (9.213) 


[This also follows from (9.39), since the wave function must be the 
product of three n = 0 one-dimensional ones.] 
The wave functions for n = 1= | are 


$111 = 5+7 ooo, Piio = bot ooo: ob13,-1 = b-tdoo0 (9.214) 
Applying the operators to (9.213) it is easily verified that 


bin = yy sin 6 ere (9.215) 
Pi10 = = cos Ore"? (9.216) 


1. 
Ou,—1 = —yqsin 6 e~*r e-7? (9.217) 


536 Some Soluble Problems of Quantum Mechanics 


It is always possible to get the states for which n = /. For 
1 n 


This follows from the fact that m=n and therefore /=n. The other 


states @anm May then be obtained by successive application of the 
L operator. 


Since the angular momentum states are already known, (9.218) is 
in fact only needed to determine the radial function. However, all states 
can be expressed in terms of the b operators once the angular momentum 
operators L.. are so expressed. Consider, then, 


L, = Li+ iL, 


= (yp _ ZPy) + U(ZPy — xp) 
We first express the coordinates and momenta in terms of the a operators: 


x;= a (a;+a;t) (9.219) 
1 
=——(7.— A. 9.220 
Pj Vi (a; a;t) ( ) 


Thus 


L,= > [(@_ + at )(as — ast) — (ag + a3t)(az — aet)) 


+5 Ug + ast (ay —ayt) — (a, + ay#\ag—as8)] 


] 
= 7 (apta; —_ ast 2) + (asta, _ a;ta3) 


= — a,(a,f + iaet) + agt(a, + ids) 
= V2 (bot b_— bob.) (9.221) 
by virtue of (9.173) and (9.174). Similarly it follows that 
L_= V2 (both, — bob_t) (9.222) 
We may therefore write 


l 
—S————— _ LL nnn 
Va(n+1)—n(n—1) - 


Pn, n, a~l = 


10. Three-Dimensional Oscillator 537 


[by virtue of (9.71) and (9.72)] or 


Pn,n,n—1 = SH V2 (bots — byb_t) (bt) bo00 


Tava 


The bo operator on the ground state gives zero, so that the second term 
goes out. In the first 


b,(b44) "boo = 2( bs)” Hoo0 


and therefore 
1 
n,n, n— = ——— b b Tt n-} 9.223 
p n, 1 Vn—1 ot ( + ) ooo ( ) 
Let us carry this one step further. 
1 
n,n, n-2 —— L_ n,n n— 
Pn nin-2 Vn(n+1)—(n—1)(n—2) Pann 
—__! 1 _ n- 
—_ VIn—1 (n— 1! (bot by. bob_t) bot (b.7) 1@ooo 
ee ee 
~ VIn—1 V(n—2)! 
x [ (bot)? _ (n ~ 1)b_tb,T] (b.T)” *do00 (9.224) 


Problem 9-10: Show that the three-dimensional 
oscillator states with n = 2 and / = 2 are 


1 \3/4 . ; 


3/4 
dboo1 = 4 (zr) sin 9 cos 0 e”r2e77"? 


— 2(1 3/4 2 2 p—72/2 
ooo = 2 3\a (3 cos? 6—1)r7e 


d22,-1 = = 2a 
P22, -2 = bree 


The question then arises, how do we obtain the states for which 
1 = n—2s, where s is an integer? 
Consider first the case s = 1. To obtain the state oy, n-2,n-2, WE See 


538 Some Soluble Problems of Quantum Mechanics 


that we must combine the following states: 
N,=n-—2, N_=0, No = 2 
N,=n-—1, N_=1, No = 090 


in such a way that L, operating on the combined state gives zero. That is 
to say, we must choose a and £ so that 


(bot b_ — bob.t) La(b.)"? (bot)? + B(b4)” *b_T boo = 9 


This leads immediately to the condition 


B=2a 
so that 
Pn.n—2,n-2 = A[(b4T)”* (bot)? +2(b44)*b_t]bo00 (9.225) 
Since 
] n—-2 2 
Vannut (bs 7)" (bot )?hoo0 
and 
I n-1 
V(n—1)! (b,T) b_tdo00 


are normalized, the normalization condition on (9.225) is 
o?[2(n—2)!4+4(n—1)!J = 1 
so that 
a ae (9.226) 
V(n—2)!2(2n—1) 
Therefore, finally, 
1 
Pn, n—- 2-2 > OE 
on? V(n—2)2(2n—1) 


X [(bs4)"? (Bot)? + 2(b44)" "bt ] bo00 (9.227) 


Problem 9-11: Show that 
2 
200 = “4 (r?—1)e""? 


Problem 9-12: Calculate all of the three-dimensional 
oscillator functions for n = 3. 


10. Three-Dimensional Oscillator 539 
Answers 
en re 
$333 = V6 qai4 sin? 0 e*”r3 e~ M72 
ee oe 3 
base = 737 Sin’ 0 cos 6 er? e-17/2 


_ l . , _,2 
$331 = V0 724 sin 6 (5 cos? 6—1) er? e-7?/2 


2 
330 = 15 a (5 cos? 0—3 cos @)r? e777? 
dss, —m = “fe 


ds = Te = sin 0 e(2r3 — Sr) e772 


= 1 l —r2 
$310 = V5 7 cos 6 (2r? — Sr) e772 


os1,-1 = $311 


Problem 9-13: Find the momentum eigenfunctions 
for the 1s, 2s, and 2p states of the three-dimensional oscillator. 


Since, knowing ¢,1, we may always obtain the ¢,1., (m < J) by using 
the L_ operator, there remains the problem of finding ¢,,. We know that 
[= n—2s, s taking all integer values for which / is = 0. 

The state in question must be constructed of a linear combination of n 
bit, b_t, and bot operators operating On dogo. If the numbers of these 
operators are N,, N_, and No, respectively, we must have 


N,+N_+No =n 
and 
N,—N- = l= n—2s 
for the state in question. These may be satisfied by taking 
No = 2r 
N_-=s—-r 
N,=n-s—r 


where 
r=0,1,2,...,8 


540 Some Soluble Problems of Quantum Mechanics 


We therefore take 


bau = | dr(bot "(bt "(bt | ove (9.228) 
The A,’s are to be determined from the condition that 


Liban = 9 (9.229) 
or 


(botb_— bobst) [ S Mbot (AY 7.4") oo = 0 


Using the commutation properties of the b operators, this condition 
takes the form 


EMS — Mot AAP oe 


—& d,2r (bot) (BA) (DA) hoo =O (9-230) 


We obtain in this way a recurrence relation for the A,’s: 


ce 3S Neus (9.231) 
from which we obtain 
= AoE - (9.232) 
It follows that, to within a normalization constant Ao, 
bau = oD HSE Oat AVOL 7b ang (9.233) 


r=0 


In all of the preceding discussion of the oscillator problem, lengths 
are expressed in units of (4?/mk)'*. It follows that, if normalization is 
carried out in terms of dimensional coordinates, normalization factors 
must all be multiplied by 


mk 3/8 a mo 3/4 
(Fe) =( ‘ (9.234) 
11. Three-Dimensional Oscillator; 
Direct Solution 


Again in terms of dimensionless coordinates, the Schrodinger wave 
equation Is 
—$V*b+erb = ew (9.235) 


11. Three-Dimensional Oscillator; Direct Solution 541 


If we put 
b= Y:"(0,¢)x(r) (9.236) 


and proceed as in the treatment of the potential problem in Chapter 3, 
we obtain for the radial function x the equation 


1d l(l = 1 
~~ rdr2 (rx) + A+ 1) X+4r?x = 2ex (9.237) 
Putting ry = f, this becomes 
_fr+! S AUF) 64 ppm 26f (9.238) 
If we now make the substitution 
u=r? (9.239) 
we obtain the equation 
d*f 1 df _1 (+1) 
du’ 2udu~ (% 4 42 )f= 0 (9.240) 
It follows, on comparing this with Eq. (4.237), that 
f=e “uF (B,y,u) (9.241) 
where a, B, y, and s are determined by 
2a—k=0 
y—2s =} 
a(a—k) =—} 


ya—kB—2sa+ks = /2 
s(y—s—1) =1(1+1)/4 


and F(B,y,) is the confluent hypergeometric function. The solution of 
these equations is 


k=1, a=4 
s=3(/+1), y=1+3 
B=43(1+4) —(e/2) 


Therefore, 


1 3 3 
= 9 —72/2pl = . 4 
x=e iF (5 (I+ 3)- $.1+5,7°) (9.242) 
In general the confluent hypergeometric behaves for large r? like e”’, 
so that x diverges at infinity. The boundary condition (that x must vanish 


542 Some Soluble Problems of Quantum Mechanics 


at r—> ©) may be satisfied provided 
4(1+3) —S=7v (9.243) 


where v is an integer, so that 
e= 27+ 1+ (9.244) 
It follows that the (unnormalized) radial wave function is 


X= re PF (— vp, 1+ 3, r?) 


= ple-riap (- n=l 43 ) (9.245) 
2 2 
where 
n=I1+2p (9.246) 
Comparing this with Eq. (4.295) of Chapter 4, we see that 
x = Arle L101), (r2) (9.247) 


where A is an arbitrary constant and L is the Laguerre polynomial. 


Problem 9-14: Calculate the normalized radial wave 
function for 


n= 2,1=0, and /= 2; and n = 3, /= 1, and /=3. 


12. Hydrogen Atom 
The wave equation for a particle in a central field with potential 


y= a (9.248) 


We, Ze, 
am VU w= Eb (9.249) 


If —e is the electron charge, the case in which Z = 1 is that of the hydro- 
gen atom. Z = 2 corresponds to the helium singly charged ion He*, Z = 3 
to the ion Li**, etc. All such single-electron ions are generally designated 
““‘hydrogenic.”’ 

The problem of these systems is not really that of a one-particle 
system; the kinetic energy Is 


T = hmi,? +4Mi (9.250) 


12. Hydrogen Atom 543 


where m, M are, respectively, the mass of electron and nucleus. If we 
introduce the position vector R of the center of mass and the relative posi- 
tion vector r, 


1 
R= mam nr + Mrz) (9.251) 
and 

r=f,—TYp (9.252) 

we find that 

1 mM 
a. de 

= 5 (m+ MR +57 tur (9.253) 


If we take a system in which the mass center is at rest, we may ignore the 
first term. The problem of relative motion is then formally identical with 
the problem of a single particle with mass mM/(m-+ M) (the ‘‘reduced 
mass’’). For M > m, this is very nearly m. In general, the m in (9.249) 
should be replaced by the reduced mass. 

If we make the substitution 


w= O(n) Y/"(9, ¢) (9.254) 
the equation for the radial wave function ®, is 
2 
-- veo, — D) |-2£4,= E®, (9.255) 


in the state in which the angular momentum state is given by the quantum 
numbers (/, m). 

The equation may be simplified by multiplying through by 2/mZ7e4 
and introducing the following dimensionless units: 


é= es (9.256) 
and 
e= = (9.257) 
0 
where 
Z2 4 
E,= rr (9.258) 
If we now put 
E®, = fi 


the Schrodinger equation takes the form 


oh A it Ghia no (9.259) 


544 Some Soluble Problems of Quantum Mechanics 


The “factorization method” can be used again in the problem, though 
in a somewhat different form than in the case of the oscillator problem. 
Introduce the operators 


d [+1 1 
+ — 
A, dé € ‘141 (9.260) 
and 
_d@ ltl 1 
Ar dt e Z —T+1 (9.261) 
It is then easily verified that 
_ af, Kl+1) 1 
+f — 2b = f, -——_—_ 
By virtue of the wave equation, this may be written 
_ 1 
A, Attf=—- let aap hi (9.262) 


where e, is the energy, in dimensionless units, for the angular momentum 
state /. 


The equation for the angular momentum state (/+ 1) can be written 


A Arfin =— fev + (esr 7) | Sis (9.263) 


Let us now operate on (9.262) with the operator A,*, and on (9.263) 
with 4,, to obtain 


At Ar [Ath] =— Jer+ (4) | [Ari] (9.264) 
and 
A, At [Arfi =—- eve + (esr i) [4;fil (9.265) 
By comparing (9.265) with (9.262) we see that 
E141 = € (9.266) 
and 
Ar fi = Kif (9.267) 


where we assume the ©’s to be normalized. Thus, the energy levels do 
not seem to depend on /; this represents a rather unexpected degeneracy. 
The quantity K, may be obtained by normalization. Writing 


(g1, 82) = | g# (eer) dr (9.268) 


12. Hydrogen Atom 545 


we derive from (9.267) that 


KP = (Ar fia1, Ar fas) (9.269) 
by virtue of the normalization of f;, that is, 
(fof) = 1 (9.270) 


On integration by parts, and using the condition that the wave func- 
tion must vanish faster than 1/r at infinity, we find that 


K? =— (far Art Ar fis) 


1 2 
= €141 + (4) (9.27 1) 
Similarly a comparison of (9.263) and (9.264) shows that 
Ar*fi= Ki fis (9.272) 


where K; is a constant to be determined. The consequence of equating 
scalar products of the two sides of (9.272) is that 


Ki’ = (Arf, Arh) 
= —_ (ft, AC Arh) 


_ 1 \’ 
= e+ (7 7} (9.273) 
so that 
1 2 
K, = K,=,/e+ (es) (9.274) 


We note that, if « > 0, there is no upper limit to /. Since positive 
energy corresponds to an unbound (scattering) state this is not a par- 
ticularly surprising result. 

If the energy e€ is negative (that is, if we are concerned with bound __ 
states), since K;? and K;” must be positive, / must be bounded. The only 
way to break the chain of states of increasing / due to successive application 
of the A,* operator is by having 

e€= 4 ~(9:2F5)._ 
where nis an integer. Then the largest possible value of /is | = n—1. 
‘The equation for the lowest energy state is [from (9.272)] 


Agtfyo = 0 (9.276) 
More generally, the / = n—1 state is determined by 


Ajx-ifn.n-1 =0 (9.277) 


546 Some Soluble Problems of Quantum Mechanics 


where f,, n-1 designates the state with energy e = — 1/n? and / = n—1. The 
differential equation to be solved is 


df _n,.,1 
dé eth Lf 0 (9.278) 


which has the solution 
Snsn-1 = Cé"e*" 


The normalization factor C is determined by 
C2 J E2Me-2In dé =1 
Since the integral has the value (2n + 2) !(n/2)2"*3, 


C (7 | 
An V2n 


Finally, then, we may write down the wave functions 


Wnim = Wn, n—1,m 
n (2n)! 


The normalized radial functions for lower angular momenta are obtained 
as follows: 


1 
V 1/(1+ 1)? — (1/n?) 
so that ®, )_, is expressed in terms of ©, , by the formula 


1 1. 
(1/I?) _ (1/n?) a (€®,,) (9.281) 


fu = Ar fats (9.280) 


®,,, i-1 = 


Problem 9-15: Show that the following are the radial 
wave functions for the n= 1, n= 2, and n=3 states of the hydrogenic 
atom: 


P19 = 2e* 


®, = 


Val é e782 


2,0 = 575 (E-2) e* 


12. Hydrogen Atom 547 
_2v2_ 
81V15 


®,,= “tv (6 —£2)¢-#3 


®, .= aivis? © —€/3 


® give 38 —$/3 
Problem 9-16: By comparing (9.259) with (4.327) 
show that 
1 
©, = Ce-V=# F (I+1-—, 2(1-+1 2v=<8) 
1= Cee Jae I+ 1), 20 


for e < 0, C, being a normalization constant and F the confluent hyper- 
geometric function. 

Considering the asymptotic form of the confluent hypergeometric 
function, show that, for ®, to vanish as € — ©, we must have 


e+-3---— 


n 


where n is an integer = (/+ 1), so that 


®, = Cee 'F (1+ 1—n,2(I1+1), =) 


Problem 9-17: Using the expression for Laguerre 
functions in terms of confluent hypergeometrics given in (4.295), show 
that 

®, = Cre tne a8 2i+1 (=) 


n—(l+1) n 


Use this to verify the results of Problem 9-13. 


Problem 9-18: Find the momentum eigenfunctions 
for the 1s, 2s, and 2p states of the hydrogen atom. 


Problem 9-19: Determine the eigenvalues and eigen- 
functions of the positronium atom (electron + positron). 
The annihilation rate depends directly on the density at coincidence. 
Comment on the value of this quantity in the various states. 


Problem 9-20: Consider an atom with one 2p 
electron placed in the electric field of a crystal, the field being represented 


548 Some Soluble Problems of Quantum Mechanics 


by the potential 
V =Ax’?+ By*—(A+B)2z? 
Treating the field as a perturbation, determine the levels to first order. 
Problem 9-21: A particle is subject to the central 


attractive potential —V,e~~. Show that the energies of its s states are 
given by the equation. 


where 


Thus, show, that, for there to be a bound state, it is necessary that 
242 
V> (2.405 )2h7a2 


2m 


Problem 9-22: The potential of a diatomic molecule 
may be represented by 


in appropriate units. Show that the radial wave function has the form 
R= rs Je-"F(s—y?"/X, 2s, 2dr) 
where F 1s the confluent hypergeometric function, 


wl E| wD 


— » Te : 


h2 
s=34 V¥+ (+Iy 


pis the reduced mass and / is the angular momentum. Show that the 
energy levels are given by 


h2 y* 


|e ee A 
2m [nt$4+ V+ EE? 


Problem 9-23: Another form of potential energy 
curve for a diatomic molecule is 


V = D(1—e774R)2_ DN) 


14. Partial-Wave Theory of Scattering by a Potential 549 


where R = (r—a)/a. Show that the s-state wave functions are 


— 1 .-w2s7(—n, 2s + 1,2) 
r 


ere _ [pa |E 
Z= ae~7AR s= \/ a 
pa®?D 


= _  /2uDa? 
h? By (s 1 2), Y h? B? 


Show also that the energy levels are 


2 8B°D _ 144 2B 
Eq = (nth aniae (td 


where 


h= 


13. Positive Energy States and 
the Problem of Scattering 


There are two ways of dealing with problems of scattering. One 
approach is the time-dependent one, in which the incident particle is 
described, let us say, by a wave packet, and we follow its evolution in 
time according to the operator U(t, fo) introduced in the previous chapter. 
Normally, the initial time fo is taken to be —©, and the particle is initially 
free (i.e., localized outside of the potential). The probability of scattering, 
and the scattering cross section, are then calculated from the value of U 
at t= ©, when the particle is again far from the scattering center. Thus, 
the process is described in terms of the “‘scattering operator” (“scattering 
matrix,” if one deals with matrix elements of the operator) 


S = U(—«, 0) (9.282) 


The other approach, which we follow here, is to describe the scatter- 
ing in terms of a steady-state formalism. (This is of course only possible 
if the scattering potential is independent of time.) In this approach, we 
imagine a continuous influx of incident particles, and hence also of 
scattering ones. Since nothing then varies with time, we can work with 
stationary state wave functions. 


14. Partial-Wave Theory of 
Scattering by a Potential 


We consider then the problem of a particle which is free outside a 
sphere of radius r = a but which Is scattered (and perhaps absorbed) by 


550 Some Soluble Problems of Quantum Mechanics 


a potential inside the radius. We assume that the incident particle has a 
given momentum p and therefore wave number k = (1/#)p. This may be 
taken in the z direction. The particle may then be represented by an 
incident wave function W,,. = e“*. Following now the formalism of 
Chapter 5 on scattering of sound waves [formulas (5.202) and following] 
we expand the incident wave in angular momentum components, 


elke = 5 il(21+ 1) j:(kr) P;(cos 6) (9.283) 
i=0 


This is a decomposition into angular momentum eigenfunctions, re- 
presented by the Legendre polynomials P,(cos 6). The expansion ex- 
presses the fact that a plane wave has components of all possible angular 
momenta about the scattering center. 

Using the formula for j,(kr) in terms of the Hankel functions, 


kr) = 3 [AP(kr) + hi (kr) (5.204) 


and recognizing that h,“ describes a flow outward from the scattering 
center and h,; one inward towards it, we can describe the scattering in 
terms of a modification of the outgoing wave. Thus, we can write the total 
wave function, including the scattered wave, as 


y= $ > MQI+ 1) [wh (kr) 25 h(kr)] P(cos 6) (9.284) 
the scattered wave being specifically 


Woo =F DY H21+ 1).— YhOCAr)P (cos 8) (9.285) 


which is, of course, entirely outgoing. 

The constants w, are determined by solving the wave equation 
inside the region r = a and fitting w and its radial derivative at r= R. 

Since the wave function must be finite at the origin, ns = ¢ must be 
zero there. 

If we expand 


d= > Rir)Pi(cos 6) (9.286) 
in the region of the potential, R, satisfies 
_ #2 [@R, K+) _ 

an 7 2 R,| +VR,= ER, (9.287) 
or 

@R, (l+1) 

Ga a Ri + (UDR = 0 (9.288) 
where 


2 
U =F) 


14. Partial-Wave Theory of Scattering by a Potential 551 


and 


= (9.289) 


Only one of the two solutions of (9.288)—the one which goes as r' for 
r— Q—is physically acceptable. Therefore, there is only one arbitrary 
multiplication constant in R;: 


R, = Agr) (9.290) 


Since the radial wave functions for each angular momentum (“partial 
wave’’) must be fitted to its outside value at r = a, the following equations 
determine the matching of the wave functions at the boundary: 


5 21+ 1)[wiho(ka) + h,2(ka)] = Arn g(a) (9.291) 


> (21+ 1)[wyh,®' (ka) +h," (ka)|k = A; (7 gir ) 


, 
Tr=a 


=“ (e'@—Z eda) (9.292) 


Dividing the second of these equations by the first, we obtain an equation 
for Wi, 


wihi"(ka) + hi'(ka) _ gila)_, _ 
wihi(ka) + h@(ka) a ae 1 = y({E) (9.293) 


We have written y,(E) in recognition of the fact that it does in fact depend 
on the energy of the incident particle. 
Solving for w;,, we find that 


— kah;?' (ka) — yihi{?(ka) 
kah, (ka) — y,h{(ka) 


Thus, by solving the wave equation inside the region of the potential it 
is possible to find w;. The problem of the scattering, absorption, and total 
cross sections may be solved in terms of w,, which may now be assumed 
known. However, let us, before proceeding, make one general observa- 
tion. If y, is real, the numerator of (9.294) is the complex conjugate of the 
denominator and |w,| = 1. In this case, the flux of particles outward from 
the scatterer is the same as the flux in toward it; there is, therefore, no 
absorption. If, on the other hand |w,| < 1, some absorption takes place. 
It is clearly impossible, on physical grounds, to have |w,| > 1. 

Let us now calculate the cross sections in terms of the quantities w;. 
The partial cross sections (for a given /) are defined as follows: 

Scattering cross section o = (flux of scattered wave)/(incident flux 
per unit area). 


Ww, = 


(9.294) 


552 Some Soluble Problems of Quantum Mechanics 


Absorption cross section o(?.= [(flux of convergent wave) — (flux of 
divergent wave)]/(incident flux per unit area). 
Total closs section 


o? = o%+o%, (9.295) 


The incident flux per unit area is the mean value of the velocity 
operator (1/m)p; 


| e7 tkz = pz e** dz dx dy 


where the x and y integrals are over a unit area. This is 


_ hk 
Fine — m (9.296) 


The total efflux associated with the /th component of the scattered 
wave is 


FQ = a | WO |? 2a? du (9.297) 


where « = cos 8. This should be calculated at large r, where we may use 
the asymptotic form of the Hankel function. From (9.285) 


pO = ; H2I+ 1)(w.— DC) a P,(u) (9.298) 
It is an immediate consequence that 
Fe = ue q (alt 1)7|(1 —wol as 2ar 4 
= 7 al+ Did woe (9.299) 
Thus the scattering cross section is 
o® = Fe —F cit DIG —wol? (9.300) 


The terms involved in calculating the absorption cross section may 
be written down immediately from the above. The converging flux is like 
the F® above except that the factor (w,— 1) is missing; in the diverging 
flux the (w,— 1) is replaced by w;. Therefore, 


oR, = Fa (21+ 1) (1— wil?) (9.301) 


For large enough /, w, > 1 and there is no longer any absorption. If we 


15. Low-Energy Scattering 553 
write Oso = >, a (9.302) 
l 


and Cabs = x Oxbs (9.303) 


the series converge. This is because, as / increases, less and less of the 
wave function is found within the region where the potential is nonzero. 
Since the formulas for the partial cross sections are the same as in 
the problem of sound waves (Chapter 5), the inequalities discussed there 
[Eqs. (5.233) to (5.247)] again apply. In particular, the total cross section 
is 
of, == Re ¥ t+) =m) (9.304) 


15. Low-Energy Scattering 


An interesting case which we explore more fully is that in which the 
energy of the incident particles is low, so that ka < 1. We discuss the 
problem of a “hard sphere,” as well as that of a constant scattering 
potential, both real and complex. 

First, let us look at the formula (9.294) for ka < 1. In terms of yo, 
Wy has the following simple form: 


_9ika 1+ Yo ika 
Wo = e 7k 1+ y>—ika (9.305) 
If we define an angle By by 
ka 
pe penta 9.306 
tan By = 7 ro ( ) 
W, Can be written 
Wo = e2%5o = e2%(Bo—ka) (9.307) 
so that the /= 0 phase shift is 
a 
do = Bo ka= tan” I+ Yo —ka (9.308) 
The scattering cross section is then 
o® = a = \i- Wol? = a 7 sin? 5o (9.309) 
A specific form is found on substituting from (9.308): 
gO = 4n[ka cos ka— (1+) sin kal’ (9.310) 
ee (1+ yo)? +k’a 


554 Some Soluble Problems of Quantum Mechanics 


For ka < 1 this becomes approximately 


0) — 47a Yo 
of = Gee (9.311) 


For the case of a “hard sphere,” w = 0 at r= a, s0 yp) > ©. The cross 
section then is 


o® = 47 sin® ka ~ 47a? (9.312) 
On the other hand, for a constant potential V = — Vp inside r = a, 
29 = sin Kr (9.313) 
where 
2 =@ (E+V,) (9.314) 
Then 
¥o(E) = Kacot Ka—1 (9.315) 


It appears that there are resonant maxima in 0 when yo) ~ —1, that is, 
cot Ka = 0. In such a region of energy 


og = pz cos? ka = a (9.3 16) 
Note that the selection of a complex value for the scattering potential, 
V=—V,-— iV, (9.317) 


leads to an absorption in addition to a scattering. K, as defined in (9.314), 
becomes complex 


K = K,+ik, (9.318) 
and 
8o = sin K,r cosh K,r+icos K,rsinh Kor (9.319) 
In this case 


¥(E) = (K,a+ iK,a) (cos K,a cosh K,a—isin K,a sinh K,a _ 
SC a ee AOS 1 COS ee ee ie ee 


sin K,acosh K,a+icos K,a sinh K,a ; 
= 91 (E) —iy2(E) (9.320) 
It follows from (9.305) that 
wy = enzika Lt yi (ye ka) (9.321) 


1+y1—i(ye+ ka) 


15. Low-Energy Scattering 555 


From (9.310), the absorption cross section is 


T 
oWs = k2 (1 7 | wl?) 


Le 1 _O+y)? + eed 
k? (1+ 91)? + (yo + ka)? 


_ 47ay2 ] 9.322 
k (+y)'+ (at kay? ee 


Note that the absorption cross section goes like the inverse of the 
velocity of the incident particles for low incident energy. 


Problem 9-24: Show that, for the potential V = 


= (V; + iV.) °) 
_ Kya sin K,a cos K,a+ K,a sinh K,a cosh K,a 4 
"1 sin? K,a cosh? K,a + cos? K,a sinh? Ka 
and 
_ Ka sin K,a cos K,a— K,a sinh K,a cosh K,a 
Ye sin? K,a cosh? K,a+ cos? K,a sinh? K,a 
Problem 9-25: Defining a resonance as an energy 

E, such that 


1+y,(E,) =0 
in the neighborhood of the resonance 


Le v1 (E) a (E=E,)y,; (E,) 
Show, then, that 


go. = dma, 
as kyi(E;) (E— Ey)? +0 
where 
(cies 2y2(E,) = 2(y2(E,) + ka) 
ey (Es) Vi (E,) 


The general formula for the absorption cross section is easily ob- 
tained from (9.305) and (9.301), 


of = FAK 
ee (1+ ,)?+ (y2+ ka)? 


Expanding about the resonance energy E,. for which 


1+ y,(E,) =0 


(9.323) 


556 Some Soluble Problems of Quantum Mechanics 


we obtain the formula 
om -~7_ eka 
Cabs — 2 (E— E,)*[yi(Er) ?+ (ya + ka)? 09.324) 
With the definitions 
_ 2y2(E,) 
Tr, = 9.325 
an yi (E,) ( ) 
and 
2ka 
<= (9.326) 
v1 (E,) 
this becomes 
oO. = wo CUA (9.327) 
abs k? (E—E,)?+3(1, +1)? ; 


The scattering and total cross sections for the / = 0 partial wave are 
obtained from 


sea {-¢ 28 I+y,—i(y || 
— = p—2ika 2ika _ } + ——_4 bh 
1—Wo=e {te f 1+ 1 —i(y2+ ka) 
2ika 
1+ y,—i(y.+ ka) 

If the second term in the curly bracket is neglected, the cross 
section is that for a “‘hard sphere’”’ of radius a, as shown above. Such a 
term is referred to as the “‘potential scattering’”’ term. 

If, on the other hand, we neglect the first term, we get, for1+ y, ~ 0, 
a resonant-type scattering. Near a resonance, and for not too strong ab- 
sorption, this term should dominate. (Otherwise, there is interference 
between the “potential scattered’’ wave and the “resonant-scattered”’ 
one.) When the resonance term does dominate, it is clear that 

4k?q? 
re (1+ y1)? + (ye+ka)? 


= e-2ika { (ere — 1)- (9.328) 


(0) — 
Ose = 


(9.329) 


Expanding 1 + y, = a(go(a)/go(a)) around the resonance, we then obtain 
the result 


r 
2(E= E,)?+4(1,+1,)? 


GO = (9.330) 
We note that this approaches a constant value as the energy — 0. 


Equations (9.327) and (9.330) may be combined to give the total 
resonance cross section 


Po. 


(1) 
Otot — 2 (E— E +s [? (9.331) 


15. Low-Energy Scattering 557 


where 


r=r,+1,=2 (wehErL tee) (9.332) 
¥,(E,) 

These formulas for the resonance cross sections are the well-known 

““Breit—Wigner’’ formulas. 


Problem 9-26: Prove that the scattering cross sec- 
tion may in general be written 


An. 1 l? ; l(E-—E 
a = vx {sin’ ka Pra E-E) tir sin ka aaa 

The first term is called “potential scattering”’ and the second “‘reson- 
ance scattering.’’ The third term then represents an interference between 
them. 

Show that interference enhances the cross section at energies below 
the resonant energy, but gives rise to a destructive interference above it. 
For ka < 1, I,,  < E, sketch the behavior of the cross section as a 
function of energy over a region of energy around the resonance extend- 
ing Over a range substantially larger than I on each side of it. 


Problem 9-27: Find the bound states for a 6 func- 
tion potential in one dimension. [V = — V,8(x)]. Calculate also, as a func- 
tion of energy, the reflection and transmission coefficients for a free 
particle of energy E and positive momentum, scattered from the potential. 


Problem 9-28: Derive a formula for the differential 
scattering cross section of a particle impinging on a central potential, 
given that there exists only one very weakly bound state for the system. 
Sketch the energy dependence of the cross section. 


PRELUDE TO CHAPTER 10 


This final chapter is concerned with some elementary 
notions of the quantum mechanics of many-body 
problems. We stop short of the Green’s function 
methods which dominate the literature of the subject 
at the present time, but provide the background on the 
basis of which these methods are built. 

Our approach is to consider first the treatment of 
systems of identical noninteracting particles. The 
method of second quantization is first introduced in 
this context; the introduction of interactions then 
represents a separate and distinct step. 

Consider the problem first from the viewpoint of 
Schrodinger wave mechanics. Making the assumption 
of indistinguishability of identical particles, we see 
that states must be either symmetric or antisymmetric 
in exchange of two particles. This leads to the distinc- 
tion between two kinds of particles; when the wave 
function is symmetric, we call the particles bosons; 
when antisymmetric, fermions. 

The second quantization theory of bosons is 
developed as follows: Since, for noninteracting 
particles, states of many-particle systems may be 
specified in terms of individual-particle states, these 
many-particle states may be completely specified by 
designating the occupation of the single-particle states. 
In complete analogy with harmonic oscillator theory, 
individual-particle creation and annihilation operators 
may be defined which, operating on a given state, 
create a system containing either one particle more or 
less. These operators obey the commutation relations 


Cz ag't | = Sea! 


a, being the operator which annihilates a particle in 
the single-particle state |a), a,f, that which creates a 
particle in state |a’). Operators 


lb) = Dada), (Whl= > avtia’| 
559 


560 


Prelude to Chapter 10 


can be defined, as well as Schrédinger-like operators 
w(r), wt(r) which in effect annihilate and create a 
particle at a given point, and their commutation 
relations follow. It is then shown that many-particle 
operators may be introduced. If the “free” particles 
are subject to scattering by a potential, an operator 
may be defined which describes the scattering of a 
particle in any state into any other, with an amplitude 
determined by the operator. Scattering is described in 
terms of a product of two operators, one of which 
annihilates a particle in one state, the second of which 
creates a particle in another. 

At this point the generalization to fermions is 
given. It is shown that the antisymmetry condition Is 
satisfied if the creation and annihilation operators 
satisfy anticommutation, rather than commutation 
rules. The rest of the chapter applies to either bosons 
or fermions. 

Two-particle (interaction) operators are then 
introduced. These describe the annihilation of two 
particles in initial states, and their creation in final 
states, i.e., they describe the mutual scattering of 
particles under interaction. It is in fact shown that 
valid many-particle operators are formed by replacing 
the expression for mean values in Schrédinger wave 
functions by w operators. The order of these operators 
must, of course, be such as to make the resultant 
operator Hermitian. 

The scattering of free particles in momentum 
eigenstates is then considered as an example, and it is 
shown that the formalism ensures conservation of 
momentum provided the interaction is translation 
invariant. 

We next develop, using second quantization, the 
well-known Hartree-Fock method. It is shown that 
this approximation neglects all scatterings except 
those in which two fermi particles are scattered back 
into their original states, or are exchanged, i.e., each 
is scattered into the state originally occupied by the 
other. The basic features of Hartree-Fock theory are 
developed in some detail. 

As a particular result of interest, we show that 
plane waves (momentum eigenstates) are solutions of 


Prelude to Chapter 10 561 


the Hartree-Fock equations for translationally in- 
variant potentials (e.g., Coulomb or Yukawa inter- 
actions). The various contributions to the energy are 
calculated. 

For such a “fermi gas’ the dependence of kinetic 
and exchange energies on density is calculated (the 
direct interaction or “Coulomb” energy does not 
depend on density). 

Since the kinetic energy per particle is positive 
and increases with increasing density in such a way 
that it dominates at high density while the exchange 
energy which operates between particles of the same 
spin, is negative and decreases with increasing density 
in such a way that it dominates at low density, we ask 
whether at sufficiently low densities the spins of all 
particles would not tend all to align, creating spon- 
taneously a ferromagnetic state. It is found that this ts 
true for sufficiently small densities —at least slightly 
smaller, however, than those existing in any good 
metal. 

As an introduction to the development of density 
matrix theory we then calculate the number of excited 
states of a many-body system per unit energy interval. 

The theory of statistical operators (‘‘density 
matrices’’) is developed, permitting us to calculate 
ensemble averages of dynamical variables which are 
macroscopically indistinguishable though microscopi- 
cally different. The statistical operator then gives 
complete information about macroscopic systems. Its 
dynamics (time variation) is determined. 

The statistical operator is determined for the 
canonical ensemble (for which the number of particles 
is fixed) in thermal equilibrium with its surroundings, 
and then for the grand canonical ensemble, for which 
only the average number of particles is fixed. 

As an illustration of the preceding theory, the 
ensemble average of the number of particles in an 
energy state is calculated for both bosons and fermions. 
These are the familiar Bose-Einstein and Fermi-Dirac 
distribution functions. 

The linear response theory (originally due to 
Kubo) is then developed. This is a theory of response, 
defined by the perturbation of the density matrix in the 


562 Prelude to Chapter 10 


linear (first-order perturbation) approximation, to an 
arbitrary external stimulus. This result has a wide 
range of applicability; of particular interest is the 
response of an electron system to an electromagnetic 
perturbation. In particular, we have calculated the 
transverse conductivity of an electron gas. 

Finally, we apply the above result to the problem 
of propagation of an electromagnetic wave in an 
electron gas —a problem already discussed in simplified 
form in Chapter 5. The dispersion relation for the 
electromagnetic wave is determined (by what is 
essentially a time-dependent Hartree-Fock approxi- 
mation), and it is shown that such waves may be 
propagated in “free-electron” metals at frequencies 
above the plasma frequency. 


REFERENCES 


Fano, U., Rev. Mod. Phys. 29, No. 1 (1957). 

Kittel, C., Quantum Theory of Solids. New York: Wiley, 1953. 

Mandl, F., Introduction to Quantum Field Theory. New York: Interscience, 1959. 
Mattis, D., Theory of Magnetism. New York: Harper and Row, 1965, Chapter 8. 
Messiah, A., Mécanique Quantique. Paris: Dunod, 1958. 

Slater, J. C., Quantum Theory of Matter. New York: McGraw-Hill, 1951. 


Ziman, J., Elements of Advanced Quantum Theory. New York: Cambridge 
Press, 1969. 


lO 


QUANTUM 
MECHANICS OF 
MANY-BODY 
PROBLEMS 


“Don't argue about the difficulties. The difficulties 
will argue for themselves.”’ 


Winston Churchill 


1. Introduction 


In this chapter we develop the bases of the theory of many-body 
systems, using the so-called ‘“‘method of second quantization.” 

To develop an understanding of the technique, it is best to consider 
first a system of noninteracting particles. In such a case we know that the 
Hamiltonian for an N-body system has the form 


N 
H=Y H, (10.1) 
n=1 


where H,, is the Hamiltonian of the nth particle. If we consider identical 
particles, the Hamiltonians all have the same form, but are functions of 
the coordinates r, and momenta p, of the individual particles. The wave 


function is necessarily constructed from products of single-particle states. 
If 


A, = H® (ra,Pn) (10.2) 
in the Schrodinger scheme there will be solutions of the form 
W = Wo (Pi) Wos(Fe) Way (Ew) (10.3) 


563 


564 Quantum Mechanics of Many-Body Problems 


For if we substitute this into 


Ab = Ew (10.4) 
and divide through by w, we get 
Dyce Pr) Yan =E (10.5) 
Each term on the left-hand side must be a constant, so the ,,,’s satisfy 
Wa, = Eo,Van (10.6) 
and the total energy is 
E=> E., (10.7) 


Let the eigenstates of the one-particle Hamiltonian be designated by 
|) (Schrédinger wave function (r|a)) and the energies by E,. The 
single-particle states and energies above are then chosen from this set. 

It should be noted that in (10.3) equally valid solutions of the same 
energy are obtained if the a,’s are permuted among themselves in any 
way. Alternatively, the a,’s may be left as they are and the r,s permuted 
to get the same solution. The most general solution at the energy is an 
arbitrary linear combination of these various permutations. 

However, the choice may be limited if we make the hypothesis of 
indistinguishability of identical particles. This hypothesis implies that, if 
any two particles are interchanged in the wave function, the resulting 
distribution function ||? will be unaltered. Thus, in general it would be 
necessary that 


w(...rj-..-. Pj...) =e ™W(...4n;...4;-..-) (10.8) 


If the phase y applies to all interchanges, to permute the ith and jth 
particles a second time leads to the original wave function; therefore 


et? = (10.9) 


so that y = 0 or zw. Thus, interchanging the states occupied by two par- 
ticles either leaves the wave function unaltered or changes its sign. The 
second implies that two particles cannot be in the same state, since then 
the wave function would equal its negative. 

There appear to be two different kinds of particles in nature: those 
for which y = 0 being known as bosons and those for which y = 7 as 
fermions. 

We now describe our many-particle system in a different way, by 
simply specifying the occupancy of the single-particle levels |), that is, by 
specifying the choice of a,, a2,..., @y from among these levels. A many- 
particle state of noninteracting particles may then be specified by the 


1. Introduction 565 


state vector 
|myNg...Ng---) 
For an N-particle system, 
> Ng = N (10.10) 


A formalism may now be developed in which the filling of single- 
particle levels is treated analogously to the adding of quanta to an oscil- 
lator, as discussed in Chapter 9. (Note then that each single-particle state 
has its oscillator analog.) We thus introduce for each state |) in analogy 
with (9.30) creation operators a,f with the property 


Agt|... Ng...) = Vngtl]... (mgt1)..-) (10.11) 


and annihilation operators a, such that 


Ag| +s Nqges.) = Vigl... (Ma—1)---) (10.12) 
Two results are worth noting: 
(a) AgAat — Atay = 1 (10.13) 
For 
AgAgt |Ng) = AgVNgt+ 1|Mg+ 1) 
= (n,+ 1)|nQ) (10.14) 
and 


AgtAg|Na) = Agt Vnel|ta— 1) 
= no|Ne) (10.15) 


Subtracting, we obtain (10.13). (In the derivation, we have ignored the 
occupation of the other states, which is irrelevant.) Writing 


[a., at | = AgAgt — Agtag = | 
and noting that 
[au aut | = 0, a’ ~a 


we obtain commutation relations 


[ay at |] = Sac! (10. 1 6) 
It is also obvious that 
[a,t, at] =0 (10.17) 
and 
[aq Ay] = 9 (10.18) 


(b) — agtaglnte) = Nq|da), as obtained in (10.15) above. Thus the 
states |n,) are eigenstates of a,ta, with eigenvalue nq. We therefore call 


566 Quantum Mechanics of Many-Body Problems 


a,ta, the ‘“‘number operator’; its eigenvalue is the number of particles in 
the state a. 
Let us now define operators 


lb) = ¥ ala) (10.19) 
and 
(y] =D (alaat (10.20) 
In terms of Schrodinger wave functions 
b(r) = > aeva(r) (10.21) 
and 
wi(r) = > ated (r) (10.22) 
where 
w(r) = (rlp) (10.23) 
and 
va(r) = (rla) (10.24) 
Consider the operator 
(plp) = f vt (r)b(r) dr (10.25) 


This may be expanded to give 
>» (a'|a) au tag = 2 Agt Ay 
=D Na (10.26) 


so it is the operator for total number of particles. The operator for the 
total energy of the system is 


(pH |p) = [ot (r)Hw(r) dr (10.27) 


where H™ is the one-particle operator on the single-particle states. For, 
expanding the |) and (wl, 


(Y|H lp) = 2, Agta, a'|H®|a) 
= » Age} AgE a § 0’ |x) 
= p> agta.E, 
= 2 NE (10.28) 


from which the stated result follows. 
If the particles are subject to a perturbing potential V(r) the second- 


1. Introduction 567 


quantized operator is, similarly, 
(WIV) = & aetas f od (r) V(r) g(r) d°r 


= 2 Ag tag (a |V |x) (10.29) 


This is clearly correct, since the potential causes transitions (scatterings) 
from state |a) to states |’), with amplitude (a'|V|a). 

The results obtained heretofore obviously apply to particles any num- 
ber of which may occupy a given state, that is, bosons. The amendment 
necessary to make the theory applicable to fermions was first shown by 
Jordan and Wigner to be the replacement of the commutators in (10.16)- 
(10.18) by anticommutators 


LE. nl. = én+né (10.30) 

that is, to assume the relations 
[eat l= Ons (10.31) 
LG: Cat + a [cat, Ca't |. = 0 (10.32) 


Let us verify that the operators satisfying these conditions produce states 
with the desired properties. 


(i) We note first that, since cytc,t = 0, it is not possible to put two 
particles into any state; nor, since c,c,= 0, is it possible to annihilate 
two from a state. 


(ii) The eigenvalues of ng = c,tc, may be derived as follows: 
Ng’ = Cob CaCatCa 
= c.f (1—cgtce) Ca 
= CytCy 
=n, (10.33) 


Therefore the eigenvalues of the number operator are n, = 0 or ng = 1. 
This is the statement, for noninteracting particles, of the Pauli exclusion 
principle, which says that at most one fermion can be put in a given state. 
(iii) The effect of c,.c,{ on a state may be calculated as follows: 
Let 


Cat|0) = Aall) (10.34) 


where A, is a constant which may be taken to be real. Taking the magni- 
tude squared of (10.34) 


(O|caCat|0) = Aa’ 


568 Quantum Mechanics of Many-Body Problems 


But c,c,t = 1—n,; therefore, 
A,? = 1, A,g=+1 
By a similar argument 
Call) = B,|0 
where 
BZ=1 


It is essential to note that the sign to be chosen depends not only on 
the state in question, but on other states. For 


Cot Cat |Ne = 0, Ng = 0) =< Cort Cat |00,.) Te — Cat Cet |0.0.) 
and 
Ca'Cal la le) a — CoCo! | le le) 


These conditions can be met in the following way: Arrange the states 
|a) in order (say, the order of increasing energy). Then if the operators 
are applied in this order, take the constant to be 1. If they are applied in 
any other order, the constant will then be (— 1)”, where p is the number 
of permutations required to get from the original order to the one in 
question. 


2. Commutation Rules for the 
us Operators 


Once the commutation rules for the operators c,, Cat are known, 
those for w(r), wt(r) as defined by (10.21) and (10.22) follow. Since yu, 
wt can be defined for either bosons or fermions, commutation rules may 
be calculated for either case. Consider, for instance, 


[b(r), vt (r’) Jn 


Where 7 = = according as we are dealing with bosons (—) or fermions (+). 
Expanding the w’s,! 


[w(r), vt (r’)],= », Pall) Gat (r’) [das dat )n 
= 2 Galt) ga (r’) 
= 2 {r|a)(alr’) 


= (r|r’) 
= 8(r—r’) (10.35) 


IWe use a, at to represent either boson or fermion operators, and c. cf specifically for 
fermions. 


3. Representation of Two-Particle Operators 569 


It is also easily verified that 


[b(r),b(r’) jn =0 (10.36) 


and 
[Wt (r), wt (r’) Jn = 0 (10.37) 


The symmetry or antisymmetry of wave functions (according as they 
are bosons or fermions) is easily seen by putting two particles, at r and 
r’, into an initial state with no particles present: 


bt(r’)bt(r)|00.. = ae Gat (r' pa (F) ay tagt|0.0,) 


Since a, a’ are dummy indices, we may also interchange them to get 


wt(r’)pt(r)|00.. =z ps (r’) Go (F) Agtaa't|Oa0a) 


=—7 » Pa ("Pe (LF) Ag tagt |e) 


Therefore, 
wt(r')wt(r)|00...0) =4 , {ea (r' os (r)— nea* (8) Gar (r) } 
x Ag t Agt |0.0.) (10.38) 


from which it follows that the wave function of the two particles is anti- 
symmetric for fermions and symmetric for bosons. 


3. Representation of Two-Particle 
Operators 


The preceding discussion has been confined to systems of noninter- 
acting particles. The question, now arises, how do we handle the compli- 
cations introduced by interactions of the particles? We confine our attention 
to two-particle interactions, which seem to be most common in familiar 
physical systems. 

We know that, for any but the simplest potentials, it is not possible 
to find exactly even the state of a two-particle system, while the states of 
three-particle systems are almost always difficult to determine. We cannot, 
therefore, expect simple analytic solutions for many-body problems, but 
will have to develop approximate methods. 

Let us consider a system of identical particles in which all pairs 
interact with a potential v(r,r’) when the particles of the pair are at 
positions r and r’. Our viewpoint is that we can still use single-particle 
states as a basis, and consider the interactions as producing transitions 
or scatterings between these states. If the particles are initially in states 


570 Quantum Mechanics of Many-Body Problems 


|) and |’) [with wave functions ¢,(r), ¢w(r’)], 


OFF palt pale’) = ¥ Polt)eer(r’) 


x J ge Weg (r' u(r, vr’ paDGa(r’)d*r dr’ (10.39) 


Thus, the potential operating on states |a@), |a’) produces pairs of states 
|8),|B’) with amplitudes 


(B'Blolac’) = f este’ eS W0(e, Felner’ )Erd*r’ (10.40) 


We now show that the appropriate ““second-quantized”’ operator for 
the potential in question is 


3 f vt(r)ut(r)o(r, rw (r') de dr’ (10.41) 


Because of the commutation (or anticommutation) rules for the w and 
wt operators, the order of the factors in the integrand of (10.41) is 
important. It is easily seen that the operator (10.41) is Hermitian; if 
W(r) and w(r’) were interchanged it would not be. The structure of the 
operator follows the same pattern as for single-particle operators. We 
write down the formal expression for the mean value of v, and then 
interpret the ““wave functions” w, wt as operators. 

If we expand the w’s and wt’s in (10.41) in terms of the single- 
particle wave functions ¢, and their conjugates, it becomes 


4 > agtagtagay J Pai (r' og (r)u(r, rear )ea(r’) d?r ar’ 
=$ Dd (B'Blvlaa’) ag tagta ay (10.42) 


This describes the annihilation of states |~) and |a’) and the creation of 
states |B) and |’), with the amplitude (8’B|v|aa’), precisely as in 
(10.40). The factor of 3 allows for the fact that every scattering appears 
twice in the sum; a given term and that in which both £,f’ and a,a’ are 
interchanged are identical. The operator (10.41) then seems to be the 
correct one. Note, however, that it takes care automatically of the proper 
symmetry of the states under exchange of coordinates of arbitrary pairs 
of particles. 

A particular example of the preceding discussion is the case of free 
particles interacting through a potential v(r—r’) which depends only on 
the relative position of the particles. Since this potential is invariant 
under a translation of the coordinate system, we expect the interaction 
to conserve momentum, and we see that this does indeed follow from the 
formalism. If the particles are free they are individually in eigenstates 


3. Representation of Two-Particle Operators 571 


of momentum p = uk, so we designate the states |a) in terms of the wave- 
number eigenvalue k. The second-quantized Hamiltonian operator 
consists of the free-particle Hamiltonian and the interaction term. The 
first is 


[ wre oun ae = avtorg | ews (—Z, V4) em 


=> €xAnt ay (10 43) 

where 
A2k2 
Im 


(10.44) 


Ex 


The free-particle wave functions were assumed to be normalized in 
volume 1). The interaction term is 


3 J VEE WTO) —F WO) ar dr’ 


= a Sy Beet Aes t Aes Ae, i evker emit o(p—r’) eer eth dy dey" 
(10.45) 


This may be simplified by introducing relative and center of mass co- 
ordinates such that 


p=r'-r, R= (r+r’) (10.46) 


Then 
r=R-dp, r'=R+t9p (10.47) 


Substituting these in (10.45) the interaction operator becomes 
1 
702 Ss Anat Ars t Arey 
x J em iks (R+EP) 9~iky (RHP) ( p) @ ike (R-40) ek (RHP) G39 d3R (10.48) 


If we do the R integral we get 08;,+2,%s+xa. Which ensures conservation 
of momentum. But let us now introduce 
k,; = k,+q (10.49) 
and sum over q rather than k;. Also, since 
k, +k, =k, +k, 
it follows that 
k,=k,—q (10.50) 


572 Quantum Mechanics of Many-Body Problems 


The coefficient of p in the exponent of the integrand is then 
4(—k, +q+k,.+q—k,+k,] =q 
Therefore, we obtain for the interaction operator 
£f yt(r')yt(r)o(r—e' p(n) p(r') dr dr’ 
=H aM Fg (10.51) 
k,,Keq 


where 
Ug =< | el y( p) d*p (10.52) 


Equation (10.51) describes a transfer between particles of an amount of 
momentum Aq, with amplitude v, ,which is the qth Fourier component of 
the interaction considered as a function of the separation of the particles. 

It is important to note that a term like (10.51) describes multiple 
scatterings of all orders between particles. This is, of course, the reason 
why interaction problems are in general difficult. If the interaction energy 
is sufficiently small, the higher-order processes may be negligible. If it 
is large, no simple approximation method is available. If it is attractive 
and large, bound states may occur, necessitating an extension of the theory 
beyond the scattering formalism. 


Problem 10-1: It is sometimes said that a pair of 
fermions acts like a boson. 
To see the extent to which this is true, define pair creation and 
annihilation operators 
OAT ce = Cet Cet, Ain = CerCy 
where the c’s are fermion operators and k # k’. 
Verify the commutation relations 
Loner, OT eer] = 1— yp — Me 
[otpcnes OF xe] = O when (k, k’) + (k'.k"") 
Thus, our operators behave like boson operators only when they operate 
on the empty state. 
Otherwise, show that 
Lote, (1 — ng — Nye) | = — Zoey 
Latin, (1-1, My) | = 20K 
and hence show that a;.,°, aT. (1 — nj. — n,) obey the same commutation 
rules as the spin operators 


o+ =3(o,+ic,) and oz 


4. Hartree—Fock Method 573 


(This result has been used by P. W. Anderson in the theory of super- 
conductivity.) 


4. Hartree—Fock Method 


Consider a system of identical fermions of spin 3, all in a single- 
particle potential V(r) and subject to two-body interactions represented 
by the interaction potential v(r—r’). Taking explicit account of spin, and 
using arbitrary basis functions |aco), where o is the spin quantum number 
and a stands for all other quantum numbers, we may write 


b(r.o) = > CacPao (Tr) (10.53) 
where 
Yoo (Fr) = (rlac) (10.54) 


and Cy,, Ctao are the annihilation and creation operators for fermions in 
states |ao’). The corresponding creation operator is 


bt (4,0) = > Chas Pelt) (10.55) 
The ‘“‘wave functions” ¢,, must satisfy 
f e&Wearolt) Pr = BaeeBo0" (10.56) 


The many-body Hamiltonian of the system is then 


H=> | wi(r',o’) 3 Vi2+ Vie')| wr’, 0’) dr’ 


+5 a | Vir", oUt (ro )o(r’ —r")h(e',0')b(r", 0”) d3r’ d°r" 


a.o’ 


, h? , t s 
a »; CT ae'Cas' | Pvol(F ) = V'?+ Vir | Yas' d*r 


aa'o’ 


+5 Sct prerCtp0'CaaCarar (B' B|v| aa’) (10.57) 


aa! BB ato” 


using, in the last term, the notation of (10.40). 


Problem 10-2: Given a single-particle potential 
operator containing spin, for example, o@-p, show that the second- 
quantized operator may be taken to be 


Sf vie, oe - plw(r, o’) dr 
ao’ .o” 


574 Quantum Mechanics of Many-Body Problems 


and write it in terms of creation and annihilation operators for eigen- 
states of momentum and spin. 


Problem 10-3: Consider an interaction operator 
containing spin, e.g., the “tensor force’ operator 


u = (a, - R)(G, - R)vo(R) 


where R = r—r’. Show that the interaction operator may be taken to be 


2 2» 2 J Wir’, o')bT(r', o' ube’, o)(0", oe) dr’ dr" 


01.0": F2:0'2 


and write it in terms of creation and annihilation operators for eigen- 
states of momentum and spin. 


To try to find exact solutions for this Hamiltonian is an impossible 
task. We are therefore led to seek approximate methods which produce 
adequate solutions for particular purposes. The Hartree-Fock method, 
which we now develop, provides such an approximation. The concept 
behind the method is this: Since the major difficulties in dealing with 
(10.57) arise from the presence of the interaction terms, which contain 
products of four operators, we replace pairs consisting of a creation and 
an annihilation operator by their mean values. Thus, we are left only with 
terms of the sort that occur in problems in which there are only single- 
particle potentials. The problem is then to solve the resulting single- 
particle equations. The single-particle states will of course be coupled, 
since the mean values, which occur in the expansion of the resulting 
one-particle potentials, will depend on the states of all particles. 

Our development of the method depends on the following observa- 
tion: If we could find single-particle states |ao) for which 


[H, Cox] = =F Cos (10.58) 


these states would be such that the annihilation of a particle from one 
of them in an N-body system of energy Ey would produce an (N — 1)- 
body system of energy 


Ew-1 = Enw—E. (10.59) 


E, would then be, in the usual terminology, the “ionization energy”’ or 
“work function’’ of the removed particle. The many-body system could 
then be considered to be made up of particles in such single-particle 
states. 

The proof of the proposition made in the previous paragraph follows 
if one lets both sides of (10.58) operate on the N-body state in which 


4. WHartree—Fock Method 575 


there are N, particles in the single-particle state |ao). (The proof holds 
for both Bose-Einstein and Fermi—Dirac particles.) Designating the 
N-body state N(N,)); 


(Hag — CaoH)|N(Nae)) = — ExCac|N(Noo)) 
so that 


H{Cac|N (Nac) )} = (Eo— Ea) {Cac|N (Nac) )} 
or 


H|(N—1) (Noo —1)) = (Eo— Ea) |(N—1)(Nao—1)) (10.60) 


Unfortunately, exact states |acv) satisfying (10.58) do not exist, 
since clearly the commutators of interaction terms contain three-operator 
products; however, when creation and annihilation pairs are replaced by 
their mean values, the solution of (10.58) becomes possible. 

Rather than working directly with (10.58), however, it is simpler to 
calculate first the commutator [H, &(r)]. [H, c,,,] can then be made to 
satisfy (10.58) by properly choosing the basis functions ¢<,. 

Let us calculate the commutator of each term of H [as defined by the 


first form given in (10.57)] with w(r, o). The commutator with the first 
term is 


K,= |S f Ute 0) Hele’, 0" )d!.¥(r.0) | 


where 
Hy=- v4 Vir’) (10.61) 
Now 
[AB,C]_ = ABC—CAB 
= A[B,C],—[A,C],B (10.62) 
Putting 


A=Wi(r',o’), B=Hgb(r',o’), C= (rc) 
the quantity K, becomes 


Ki=—y f dr [ut(r',0’), w(t, 0) ] Hor’, 0") 
=— Hyb(r,o) (10.63) 


To calculate the second commutator 
K, = E >, | wt (r”, out (r’ , o' ) v (r” _ r’) 


Xp(r',o')b(r’, 0”) dr! d3r", b(r, o) |. 


576 Quantum Mechanics of Many-Body Problems 


we must calculate 


Lt (x bt (x' h(x’ )b(x"), b(x)]_ 


where x stands for (r, o). We expand this out as follows, using the anti- 
commutation relations for the w’s: 


[P, h(x) J- = [ht x") bt (x’) h(x) (x"), (x) ]- 
= OF (x" bt (x! h(x’ v(x") (x) — h(x) P 
= UF (x") ht (x" ay (x) W(x! ab (x") — (x) P 
= F(x") [8(x! — x) — h(x) bt x’) Jb (x!) (x") — W(x) P 
= —8(x' — x) bt (x") h(x") h(x’) 
+ [— 8(x" — x) + h(x) bt (x") Jit (x' eb (x!) (x") — (x) P 


= = OX" = x)yst (ae Wyse" ys") — BGC" — xT Wx Wx") 
Therefore, 
K,=—> i wir’, o' u(r’, o’)v(r’—r) d?r'b(r,0) (10.64) 


Putting together (10.63) and (10.64), we find that 


[H.p(r.o0)) =— | H+ > J wt (r’,o’)w(r',o’)v(r’ —r) dr’ |y(r.0) 


(10.65) 


Making now the Hartree-Fock approximation, the right-hand side 
becomes 


— Hover, o) + J (tr, ow’, 0) )o(r’ —r) d°r'd(r, o) 
> J (bt (r',o0')b(r,o0))v(r’ —r)b(r’,o’) d*r’] (10.66) 


Problem 10-4: Show that, if H as given by (10.57) 
is approximated by the operator H! obtained by “‘contracting’”’ all possible 
pairs consisting of one creation and one annihilation operator (that is, by 
replacing such pairs by the mean value of their product) in the interaction 
term, [H', w(r,o) ] is given by (10.66). 


Let us now substitute 


b(r, 0) = ¥ CocPac(r) (10.67) 


into the Hartree-Fock approximation for [H, &(r, o)], the ¢,,.’s being the 
wave functions of an as yet unspecified set of basis states. The second 


4. Hartree—Fock Method 577 


term, which we call the “‘direct interaction”’ term becomes 


D (cteoCpa)Car | Pete (t")Pea(r’)v(r—r') dr ao (r) 
o'BB'a 


= Mee J leeo(e') 20 (r—4") de" Pag (F) Coe (10.68) 


The third, or ‘‘“exchange”’ term is 


— DS (ctereCer) Car | Pia: (8) e0(t) Pao (')v(r—r') dr" 
BB'ac’ 


== oe J eho (')Ga0(')0(t—F') de" Pp0(F) Car (10.69) 
If (10.68) and (10.69) are now substituted into (10.65) it follows that 
[H.W (r, 0) narnee—rock =X Car | HoPag(t) + >, Meo" 
x f lgeor(r’) Pv(r—r’) dr" Poo (Fr) 
— & Nee J Pio acl’ )o(r —r') d*r' peq(r)} 


(10.70) 
Substituting on the left-hand side 


[H,W(r,0)] = 2 Pacl(W) LH; Coc] (10.71) 


We then see, on comparing (10.70) and (10.71), that in the Hartree- 
Fock approximation it is possible to choose states such that [H,c,,] = 
— E.Caz provided that their wave functions satisfy 


HoPaolt) + 3) Neo" P leeoke') P08 —F") Br" Pag(K) 


— & Meo J PRO Warlt’ Or —¥') Br’ gpo(t) = EaPac(t) (10.72) 


These are the so-called ‘“‘Hartree—Fock equations.” Let us note several 
facts about them. 


Problem 10-5: Prove that, if a particle in state 
lao) is added to an (N—1)-body system to create the N-body system 
described by (10.72), the energy is increased by E, in the Hartree-Fock 
approximation. 


(a) They are coupled equations; that is, they are equations for the 
set of wave functions of the basis states which has the property that, in 


578 Quantum Mechanics of Many-Body Problems 


the approximation in question, the many-body system can be described 
in terms of occupancy of these states. 


(b) The states of the basis are mutually orthogonal. To show this, 
multiply Eq. (10.72) by ¢#, multiply the equation for of, viz., 


h? ’ - \l2 __ pl to 

am + v| gs (r) + > Mey | eae’) |2o(r —r’) dr’ yf(r) 

J ne | cele ese ev) Ar’ gH) = Exe) (10.73) 
a’ 


by ¢.(r), subtract one equation from the other, and integrate over r. By 
virtue of the Hermitian property of the operator (—4?/2m)V2+ V, the 
left-hand sides cancel, leaving 


(Ex—Ep) [ gatos (r) dx = 0 
so that states of different energy are orthogonal. 

(c) While the B sums in (10.71) range over all occupied states, 
including |a), the 8 = a terms cancel. 

(d) The sum in the first summation, the “direct-interaction”’ term, 
is over states of both spins. In the second, the ‘“‘exchange”’ term, the sum 
is over all states whose spin is the same as that of the state whose wave 
function is being determined. 

(e) The energy of the system is not the sum of the energies of the 
individual particles. This is physically evident, from the interpretation 
of the E,’s as ionization energies of the individual particles from the N- 
particle system. The total energy can be calculated as the mean value of 
the Hamiltonian (10.57). In the interaction term, the only combinations 
of operators which have nonzero mean value are those for which either 


(i) B' =a’, a=B or (ii) B' =a, B= a’, o’ =o". 


Thus the mean value of the interaction term is 
l ' ' 
D) S (Chater tas’ Car'Cate”) (a alviaa ) 


+4.D (ctaosctererCaorCerot) (ata |v) (10.74) 
Both the mean values can be reduced to number operators: 
Chal T ae Cog'Catot = — CT evar b ac'CotaCac' 
= ChateCate"CT aa’Caat — Sac Se0'Ct ato"Cao! 
which has mean value 


Retolac' — San Se'oMag! 


5. Hartree-Fock for Free Interacting Particles 579 


and 
Cla T ate Cag'Catet = — Ch aorCan'Ch ate'Cata + Saat Ch egC ox” 


which has the mean value 
— Nag Nata! See" a Sea! Nag" 


When these are substituted in (10.74), the second terms cancel each other; 
the first ones give 


$ > Not gMaug ( at’ «| v| arcx’ ) —4 > NetgMagOargx( aa’ |v| aa’) (10.75) 


It is easily verified that this is exactly half the sum of the interaction parts 
of the E,’s. This is because, in summing the E,’s, all interactions are 
taken into account twice; once in the equation for each of the two particles. 


5. Hartree-Fock for Free 
Interacting Particles 


Consider a system of identical particles which are free, i.e., not 
subject to an external potential but which interact with the potential 
v(|r—r’|). We show that, in this case, plane waves (momentum eigen- 
states) constitute a solution of the Hartree-Fock equations. 

The fact that momentum eigenfunctions provide a solution is sug- 
gested by the fact that the Hamiltonian of (10.72) with V = 0 is invariant 
under a spatial translation. It may be verified directly by putting 


DJs 
Va. vat (10.76) 


respectively, where the system is assumed to be enclosed in a volume 2:2. 
Substitution in (10.72) gives 


A*k® oer 1 ike 
ae Fe a] ele —r'|) dr’ - 


Pa> Ye 


_ Sor — ik’ -r’ s4, ik-r’ a k’-r iker 
» Nig Ly e v(|r—r'|)e*" dr’ -e = Ee (10.77) 
Taking out e“* as a common factor, we find an expression for E.. 


RPK? 
E,, =5,-+ ~ v(|r—r'|) dr’ 


-—> n,, 5 i eik-k) Ry (R) d3R (10.78) 
< 


where 
R=r-—r’ (10.79) 


580 Quantum Mechanics of Many-Body Problems 


and N is the total number of particles. (We let OQ — », N ~ ~ in sucha 
way that the particle density N/Q is a constant.) As  — © the second 
term becomes constant; we therefore pay no more attention to it, since 
it does not affect the dynamical properties of the particle. If we write 


1 
v(k) = 0 | e*T y(r) dr (10.80) 
the second term is Nv (0) and the third, the exchange term, is 
1 
— Sn, v(k—k’) 
L@) py k'o 
Let us consider the last term for the case of a Yukawa potential 
= 8° —AT 
v(r) = re (10.81) 


It is then a straightforward calculation to show that 
4g? 1] 


v(k—k’) = rom 2+ [k—k’? 


(10.82) 
The sum over k’ in (10.78) may now be replaced by an integral. The 


number of states per unit volume of k space is 1)/87°;! therefore, the 
exchange energy of the particle in state k is 


ee a | ee eee 
Vex(k) = 355 Creare Tak (10.83) 


where the integral is over the occupied part of k’ space. 
If the cosine of the angle between k and k’ is put equal to pw, (10.83) 
can be expressed as 


g 4 pea 1 
valk) yam | kak | Ee 


where we have assumed that the states are occupied up to k= ky. The 
value of kp is determined by integrating the density of states up to k = kp 
and putting it equal to the number of particles of the appropriate spin. 


1To see this consider a rectangular ‘‘box”’ of dimensions a X b X c. Free-particle solu- 
tions in it have the form sin (J,7x/a) sin (lzry/b) sin (l,77z/c) so the possible k vectors are 
(,77/a, la/b, l/c). The volume of k space occupied by each is w/abc = 7°/Q, so that the 
density of states is 0/7. But identical states in which any of the components may be 
negative are found in the other eight octants of k space. Therefore, the density of inde- 
pendent states is 40/2. 


6. Kinetic and Exchange Energies 581 


Thus 
so that 
2 1/3 
kro = (52) (10.84) 


On integrating over yz, (10.83) becomes 
_ gt fF ,,, B+ (k+k’)? 
Vex(k) = 7k J IME (kK)? 
By evaluating the integral it may be verified that 


M+ (kp+k)? 


— 8 fl op oy yo poy et he tk)? 
Vex (Kk) Flag het k) In 


+ 2kp—2, tan“! TRICE (10.85) 


Problem 10-6: Prove the formula (10.85). 


In the case of Coulomb forces g* = e? and A = 0, so that 


2 fle py, Ke tk ! 
venth) = S17 Uk ke) Ino + 2k (10.86) 


Problem 10-7: Defining the effective mass m* by 
1 _1 Ew) 
m* ff? dk 


show that at k = k, the effective mass of an electron in an electron gas is 
zero in the Hartree-Fock approximation. 


Problem 10-8: Prove that the exchange potential 
energy of an electron in an electron gas is twice as great at k= 0 as at 
k=k,, and that the average value over the occupied levels is halfway 
between that at k = 0 and that at k = kr. 


6. Kinetic and Exchange 
Energies of a “Fermi Gas” 
of Electrons 


A many-body system of electrons is often referred to as a “Fermi 
gas’”’ of electrons. In this section we calculate the kinetic and exchange 


§82 Quantum Mechanics of Many-Body Problems 


energies of such a system. We ignore the direct Coulomb energy. If the 
electron system were considered to be embedded in a uniform distri- 
bution of positive charge, so as to ensure over-all charge neutrality, the 
Coulomb energies would cancel. In an actual solid, which is a practical 
realization of a many-electron system, the positive charges are not 
uniformly distributed, but rather appear as a lattice of nuclei (or ion 
cores). There are then Coulomb energies of quadrupole or higher order. 
We confine our attention here, however, to the idealized system. 

We first calculate the kinetic and exchange energies of the spin-up 
electrons and the spin-down ones separately. Let there be N., of the first, 
N_ of the second, where the total number is 


N=N,+N- 
We then consider two distinct problems: 
(1) That of finding the condition that the ground state energy 


correspond to N,=N_; this, we see, occurs at sufficiently high electron 
densities. 

(2) That of finding the total kinetic and exchange energies when 
N,=N_. 

Consider first the kinetic energy of N, electrons of a given spin. 
Assuming the lowest electron states to be filled, it is 


_ 0 fh f*re ,, 
T= 3535 | Kak? dk 
_ a Whey 
= D507 m 
h2 6 5/3 
= 250m (Ye) (10.87) 


by (10.84). Putting O = 1, so that we calculate energy per unit volume, 
we find that 


T= AN,°" (10.88) 

where 
_ qr 4132 -M335/3 h? 
7 5 m 


(10.89) 


Next, let us calculate the exchange energy for the particles of each 
spin. A general formula for the exchange energy can be evaluated from 
(10.82). If we put g = e,A = 0, 


2 
—i(k—k’) = — ae koe (10.90) 


is the exchange energy of interaction of particles with momenta fk, Ak’ 


7. Condition of Instability of the Zero-Spin State 583 


and having the same spin. It follows that the exchange energy is 


_ _27e (XY [ as ,__ 1 
c= 2 (BE) | ak | ok =a; (10.91) 


where the k integrals are over spheres of radius k;,. The integral may be 
evaluated to give 47°k,*. The exchange energy is, then, 


Problem 10-9: Show that 
dk’ d?k —1_ 
(ep) |k —k’|? 
[Hint: Use the fact that 


= 4mk 4 


1 iv ke 

k—k | 2k FPt (cos 9) 

where kz, k, are, respectively, the lesser and greater of k, k’, and @ is the 
angle between the vectors k and k’.] 


Problem 10-10: As an alternative to the procedure 
employed above, use (10.86) to calculate the exchange energy of N par- 
ticles of a given spin. 


_ _2re? (YF 67° N,\*4 
= RE Sater (Ne) (10.92) 
Again putting 2 = 1, this becomes 
«e =—BN,** (10.93) 
where 
B = 275!377-133,4/3 92 (10.94) 


7. Condition of Instability of the 
Zero-Spin State 


It may be noted that for high enough densities the kinetic energy 
dominates the exchange, while for low density the reverse is the case. 
Now, so far as kinetic energy is concerned, it is least when there are equal 
number of particles of the two spins. Therefore, at high density we 
expect the lowest-energy state to correspond to equipartition of the 
electrons between the spin states. 

With respect to exchange energy, on the other hand, it tends to favor 
lining up the electron spins, since the (negative) exchange energy operates 
only between particles of parallel spin. Therefore, at low enough density 


584 Quantum Mechanics of Many-Body Problems 


we expect that the state of equipartition of energy will be unstable, and 
that the energy can be lowered by having more electrons in one spin state 
than in the other. These considerations might constitute a basis for a 
model theory of ferromagnetism (spontaneous magnetization). We 
therefore investigate the conditions under which the equal or unequal 
spin occupancies correspond to the lowest energy. 

The total energy is 


U = T+e=AN,53+AN_ 53 — BN,*3 — BN_* (10.95) 


The energy is a maximum or minimum when dU/dN, = 0. Remembering 
that, since N_ = N—N,,0/aN_ = —9/dN,, we differentiate U to get 


oY = $A(N28—N*)—$B(NYP—N18) (10.96) 
+ 


This is zero for N, = N_, but to determine whether it corresponds to a 
maximum or minimum of the energy, we must calculate the second 
derivative for N, = N_: 


0?U 


aN [= OA (N,73 + N_-"8) 5 B(N4*8N_-*8) 
+ 


—-1/3 —2/3 
= aA (5) iB (5) (10.97) 


for the values in question. If this is > 0, the equal distribution gives mini- 
mum energy; if it is <0, a maximum. The condition for unequal spin 
distribution 1s, therefore, 


N ~—2/3 N —1/3 
2B (5) > 5A (5) 


N\"3 _ 2B 
(5) 34 (10.98) 


A convenient and commonly used parameter is the dimensionless 
constant which is the radius of the sphere which, in units of Bohr radii, 
contains on the average one electron. If this is designated r,, it may be 
expressed in terms of N by the formula 


or 


a = : Tr 2 Ay? (10.99) 


N U3 (3 :\'8 1 
(5) = (+) ~~ (10.100) 


or 


8. Total Kinetic and Exchange Energies 585 


The condition (10.98) may then be written (for QO = 1) 


BVA. yn 2B 
Ge) r;Qq <2 SA 
or 


r, > 7741332/32-213 
= 6.14 (10.101) 


This corresponds to a density a little lower than that found in any metal 
in which the electrons can reasonably be represented as free. 

The concepts behind this calculation, however, form the basis of a 
quite reasonable approach to the problem of ferromagnetism. The 
necessary modification is to take account of the fact that the electrons 
are not free, and must be represented by more complicated wave func- 
tions, with a consequent modification in their energy spectrum. 

It is particularly worth noting that the calculation given would not 
be modified by taking account of the mutual Coulomb repulsion of elec- 
trons, since the Coulomb potential operates between ail pairs of electrons, 
and so is independent of the electron spins. 


8. Total Kinetic and Exchange 
Energies of the Electron Gas 


It is instructive to write the total kinetic and exchange energies in 
terms of the parameter r,, in the case in which N, = N_=WN/2. The 
kinetic energy is 


N 5/3 1 
T) = 2A (F) ai (10.102) 
The kinetic energy per particle is 
1 N 2/3 
f= N To = 272/34 (3) (10. 103) 
This may be written in terms of r, by using (10.100): 
fy = 2-284 (a) es (10.104) 
° 4a rs? ao? ; 
If we substitute for A from (10.89), we find that 
2 — 4133 7/3 772/3 me? 1 
n= 5 (Fas) 3 
— eset Rydbergs (10.105) 


586 Quantum Mechanics of Many-Body Problems 


since me*/2h? is the Rydberg, the energy of the ground state of the 
hydrogen atom. 
The exchange energy per particle is 


3 
Eo = nN —2-13B (5) 
— — )-5/377-2/83,5/3 le (10.106) 
rs 2ao 
= —2218 Rydbergs (10.107) 


If one goes beyond the Hartree-Fock approximation, it is possible 
to develop the energy per particle as an expansion in r,. The expansion 
is one which converges more rapidly the smaller r,, or the larger the 
density. The next term in the expansion has been shown to be propor- 
tional to In r,. 


9. Excited States of Many-Body 
Systems 


So far, we have been considering only the ground state of many- 
body systems. If such systems are in a “temperature bath” they will be 
in excited states. For a system of many particles, there are many states 
within a very small energy range. Suppose, for instance, we consider N 
free, noninteracting particles in a volume ©. The total number of states 
in the 3N-dimensional volume d°*k, d°k,... d*ky of the space of all their 
k vectors is (200/87) d°k, d*k, ...d®ky. The number of states in an 
energy range (E, E+ AE) is determined by the volume between the sur- 
faces 

MR? 
S Fm = (E, E+ AE). 


n=] 


This was calculated in Chapter 7 and found to be 


(ae) I E@N2-1A FE 


h? T3N/2) 
Therefore the number of states per unit energy range is 
— (2Q\" (2maVr? gna) 
WE) = (53) ( ms TENDS E (10.108) 


If the average energy per particle is Ey, so that E = NE, and if we use 
Stirling’s formula for the I’ function, the last two factors can be combined 


10. Density Matrix 587 


into 


3 


—_ =. 43 N/2 2 ee (3N/2)—1 
AaN © (5) 0 


for large N. Thus 


_ (20\¥ (4mmeEy\"® [31 
v(E) ~ (33) Gece Vine (10.109) 


. (20\" (Elev) \"" soon. /|_ 3 

a (2) ( ule ) 10 VENE (10.110) 
E, (eV) being the average electron energy in electron volts. We see that, 
if O is a macroscopic volume, the density of states is very large indeed. 

In fact, due to collisions, the width of the many-particle states is 
much greater than the distance between them. 

Therefore, it is not reasonable to try to determine the many-particle 
states. A macroscopic specimen with a given total energy therefore is not 
subject to an exact quantum treatment, but may be treated statistically. 
Such a statistical treatment may be formulated in terms of the density 
matrix, the theory of which we now develop. 


10. Density Matrix 


There are two ways of developing a theory of the density matrix. 
In one, the existence of quantum states for the system is assumed, and 
it is described in terms of the probability of occupancy of these states. 
In the other, due to Fano,? a more operational approach is adopted — the 
“statistical operator’ (density matrix) is defined in terms of the statistical 
averages of dynamical operators. We outline both approaches, and show 
that they lead to identical results. 


FIRST APPROACH 

Suppose that the energy states of the system are designated by |a). 
Let us imagine an ensemble of macroscopically “‘identical’’ systems. 
By virtue of the remarks above, they are not all in the same state (and in 
fact any single system is continually undergoing transitions from one state 
to another, so that a single system at sufficiently large intervals of time 
also effectively constitutes an ensemble). We therefore let the statistical 
probability that the system be in state |a) be p,. The statistical operator 
p is then defined as 


p= x |a) Pata (10.111) 


2Fano, U., Rev. Mod. Phys. 29, No. 1, 74-93 (1957). 


588 Quantum Mechanics of Many-Body Problems 


Let us enumerate some of its properties: 
(a) Tr p= 1. This follows from the fact that, if |x) is a complete set of 
states, 


Trp=% (n|p|n) 


= D (nla)pa|n) 
Since 
> |n){n| = 1 
Trp= > (ala)p.= > pa=1 (10.112) 
(b) The ensemble average of a dynamical variable K is 
(K) = Tr pK (10.113) 
Proof 


The average of K in a given quantum state |a) is (a|K|a). Therefore, 


(K) => Pata|K |e) 
But using an arbitrary basis |n), 


Tr (pK) = & (n| pK |n) 


E, nloln’)(n'|K |n) 


E_ (nlaypataln’)(n'|K |n) 


dy (n\o)pota|K |n) 


an 


=D Pata|K |a) 


= (K) (10.114) 


This result is clearly independent of the representation |n). 
(c) The time dependence of p is given by 


it 2 = —[p, H] (10.115) 


To prove this, we start from the time-dependent states 


|a(t)) = e/a) 


10. Density Matrix 589 


where we have put f = 1. Then we define 
p(t) = & la(t))pa(a(t)| 
= & ea) pal ale 


= ety (0) eit 


Therefore, 
20D — — je-H[H, (0) et 
so that 
iS = (H, p01 =—[o(0, H (10.116) 


(d) We can transform the matrix elements of the density matrix 
from one basis to another by a unitary transformation. 
In a basis |”), the matrix elements of p are 


(n'|p|n) = py (n'|a)(aln)Pa (10.117) 


To transform to another basis |v), it is mecessary to write the vectors 
|n) in terms of the |v)’s, 


(n'|p|n) = & (n'|v") (v'|a) (aly) vl) Pa 
=D (n'|v’) (v'|plv) (ln) (10.118) 

Define a transformation matrix 

Syn = (v|n) (10.119) 
Then we may write the matrix equation 

p= Stp’'S=S"p'S 

or, alternatively 

p’ = SpS (10.120) 
That § is unitary follows from the fact that 

(S TS) an! = >> StrvSon’ 

= J (nly)(v|n’) 

= {n|n') 

= Bnn’ (10.121) 


590 Quantum Mechanics of Many-Body Problems 


11. Alternative Method (Fano) 
The alternative procedure is to define p operationally as that operator 
with the property that the statistical expectation value of K is 
(K) =TrpK (10.122) 


This equation now provides a definition of p, and does not follow from 
other postulates. From this definition we can derive other properties of 
p, as follows. 

(a) Tr p = 1. This follows immediately by taking K equal to the unit 
operator. 

(b) p is Hermitian. This follows if it can be proven that 


(n|p|n') = (n'|p|n)* (10.123) 
To prove (10.123), we make use of the fact that (K) = Tr (0K) must be 
real, and must therefore be equal to its complex conjugate. Therefore, 
Tr (pK) = ¥ (nlo|n’) (n'|K|n) 


But the complex conjugate of (K) is then 


(K) = & (n|p|n')* (n|K|n') 


nn’ 


since K is Hermitian. Now, in the equation 
> (nlpln’)(n'|K|n) = > (n'|p|n)*(n'|K|n) (10.124) 


compare coefficients of the arbitrary matrix elements (m'|K|m) on the 
two sides; it follows that 


(m|p|m') = (m'|p|m)* (10.125) 
as required. 


(c) Time variation. If, in the Heisenberg representation, we intro- 
duce the time-variable operator 


K(t) = e#'Ke-t (10.126) 
the mean value of K as a function of time is given by 
Tr pK(t) = Tr p e#'Ke-#t 
= Tre "yp e'""K 
since the order of factors in a trace is irrelevant. Therefore, 
(K(t)) = Tr p()K (10.127) 


11. Alternative Method (Fano) 591 


where 
p(t) = e~# pet (10.128) 
so that 
PO =i, p(t) (10.129) 
or 
OO = —[p(),H) 
as before. 


If a system is in a stationary (i.e., equilibrium) state, so that p does 
not depend on time, 


LH,p]=0 
Or 


S [(n' | |n)<n| pln”) — (n'|pln) nl [n"y] = 0 


If the basis states |n) are energy eigenstates, this becomes 
(En — Env) {n'|p|n") = 0 


Thus the matrix elements of p between different energy states are zero. 
If the energy spectrum is nondegenerate, p is diagonal. 
(d) If, for any basis, p is diagonalized, then the diagonal matrix 
elements are the probabilities of occupancy of the states of that basis. 
Assume p to be diagonalized in the basis |n), and calculate for an 
arbitrary operator K 


(K) = & (nlpln’) (n'IK|n) 
= (nlp|n)(nlK |n) (10.130) 


(n|K |n) is the quantum mean value of the operator when the system is in 
the state |n). To get the ensemble average, we must multiply the quan- 
tum mean values in each state by the probabilities of occupancy of those 
states. Thus (| p|n) must be the probability that systems of the ensemble 
be in state |7). 


EXAMPLE: MEAN VALUE OF PARTICLE DENSITY 

Let us calculate the mean value of the particle density in a system 
with statistical operator p. We use the second-quantization formalism. 
Then the density operator is 


f vt (x')8(x' —x)Y(x’) a2x’ = HT) Yo (x) (10.131) 


592 Quantum Mechanics of Many-Body Problems 


where x is the position vector of an arbitrary point. Themean density is, 
then, 


p(x) = Tr pot (x) W(x) 
= Tr w(x) pit (x) 


Let us expand w(x), /t(x) in plane waves (momentum eigenfunctions), 
and use as basis the set of states of occupation of these momentum states. 
Then 


p(x) => ee Tre, pcyt (10.132) 
k,k’ 


Considering for the moment only the occupancy of the two states k and 
k’, 
Cyr t | Metter) == (1 — ny) |e, Me + 1) (10.133) 
and 
(nye |Ce = (1—ny) (ny + 1, rye | (10.134) 
Therefore, on substituting in (10.132), 


p(x)= > ¥ e712) (1—ny) 


alln kk 
(02. (met 1)... mye... |p]... me... (Me t+1)...) 
= SY Dd ek yg. Owl pl. ..O,.-- dye...) (10.135) 


rg k’+kk’ kk’ 
The outer sum is over all states of occupancy of all single-particle states 
except k and k’. We see, then, that the Fourier component of density with 
wave number q is associated with the annihilation of particles of momen- 
tum k’ and the creation of those with momentum k’+q, whatever k’ 
may be. 


12. Density Matrix of the 
Canonical Ensemble 


The canonical ensemble is an ensemble in thermal equilibrium with 
its surroundings, but containing a fixed number of particles. It can, 
therefore, exchange energy, but not particles, with its surroundings. It 
is part of a larger system. The remainder of the system may be designated 
the “heat reservoir.” 

Let the number of energy states of N particles with a total energy in 
the range (E, E+ AE) be o(E, N)AE. It has been calculated earlier for 
free particles [see formula (10.110)]. Suppose that our large system has 
No particles and energy Eo, and our subsystem WN particles; we are 
interested in the probability distribution of each of its energy states. 


12. Density Matrix of the Canonical Ensemble 593 


That for the states in the energy range (FE, E+ dE) is <[W(E, N)v(E,— E, 
No—N)dE/vU(E>, No)]. Therefore, the probability of occupancy per 
energy state in the range (E, E+ dE) is 


WUE, — E, No—N) 
WE, No) 


For the canonical ensemble, the N’s are fixed. Now 


In p(E, N) = In WUE — E, No— N)—In WUE, No) 


P(E, N) = X constant (10.136) 


If we expand 


_ _-(9%\ _p (9¥ 
VW(E9 — E, No— N) = VE, No) E(%) (i). 


ae 1 -z(2 In *) —w(2 In 2) 4. ] 


dE aN 


(10.137) 
we may write 


In (E)—E, No—N) = In (Eo, No) -E (? .),- N (ARP), (10.138) 
The expansion to linear terms in E and N is yalid in the limit of large 
systems of which the subsystem is a very small part, since E(d In p/ 
dN)z,.n, 1S the fraction of the number of states of the whole system which 
lie in the range of energy of the subsystem, and the series is essentially 
an expansion in powers of this fraction. 

From (10.136) and (10.138) it follows that 


= _ (dln _,,(alnv 
D(E,N) = exp | E( aE = n (aR yt (10.139) 


The coefficient (0 In v/dE),s,.y, may be identified with 1/KT, K being the 
Boltzmann constant and T the temperature. Thus 


p(E, N)= Zen (10.140) 


where Z is aconstant to be chosen so that Tr p = 1. 

The quantities p(E,,N) are now the probabilities p, entering into 
the definition (10.111) of the density matrix. The statistical operator 
may be written 


23 On 
Tr e F4 


where H is the Hamiltonian for the system and 6 = 1/kT. Matrix elements 
may then be calculated using an arbitrary basis. Mean values may also 


1 eH (10.141) 


594 Quantum Mechanics of Many-Body Problems 


be calculated with an arbitrary complete set of states as basis, since the 
trace is independent of the basis used. 


13. Grand Canonical Ensemble 


The above calculation remains valid, except that N may now vary. 
P(E, N) is given by (10.139). The quantity 1/8(aInv/dN)<,,.n, is known 
as the ‘‘chemical potential” and is designated as pz. The statistical operator 
has the form 

eK AH-EN) 
P = Tr eo MH-EN) (10.142) 


where N is now the number operator for the system. 

The exponential operators in both cases (canonical and grand 
canonical ensemble) are to be understood in the sense of series expan- 
sions. It is sometimes convenient to take the subsystems to be single 
particles (e.g., single electrons in an electron gas, or a metal, or molecules 
in a gas). We again define the denominator, Tr e~8#-“"”, as Z. 


14. Bose-Einstein and Fermi- 
Dirac Distributions 


Let us calculate, for a many-particle system, the mean number of 
particles in a given single-particle state |a), that is, 
(Ng) = Tr pat Cg (10.143) 


We of course use as a basis the states of occupation of the single-particle 
states of which |a) is one. Now if we work with the grand canonical 
ensemble, so that we do not assume the total number of particles to be 
known, 


Tr PCat Ca =3 Tr cye AHEM oF (10. 144) 

But 
e~KA-EN GC + = e~ha—M ot e KH-EN) (10.145) 
since the state created by operating with c,t first has one additional 


particle and an extra energy E, more than the one on which it operates. 
Therefore, 


1 _ap ; 


Using the commutation rule 


Calat + NCgtCo = 1 


14. Bose—Einstein and Fermi—Dirac Distributions 595 


where 7 =+1 or —1 according as the particles are fermions or bosons, 
we obtain 


(7a) = 5 e~KEQ—H) Tr ( 1- NCat Ca) e7 MH-EN) 


= e MEQ») — ne =a-™ ( Nw) 
It follows that 
e~ME gy) 
(Na) = Tp pe BE 


1 
= FEO Fy, (10.146) 
For 7 = 1 this is the Fermi-Dirac distribution; for 7 =—1, the Bose- 
Einstein. 
The chemical potential uz is determined by the condition 


> (Na) =N (10.147) 


where N is the total number of particles. (Note that the chemical potential 
is determined by the mean number of particles in the systems of the en- 
semble.) If now N(E,) is the density of single-particle states, this con- 
dition becomes 


[ N(E.)( 1a) dE, =N (10.148) 


For fermions, (n,) must always be < 1; it is therefore the probability 
of occupancy of a state and is designated f(E,), which is called the 
‘Fermi function.” 

It is important to distinguish here between the p, of (10.111) and the 
Fermi function f(£,), particularly when the subsystem is a single particle. 
The probability p, is an a priori probability, while f(E,) is the probability 
of occupancy of state E, in a many-body system, taking account of the 
commutation properties of the particle operators which embody the 
assumption of indistinguishability of particles and, in the case of fermions, 
of the exclusion principle. 


Problem 10-11: Show that, for the Fermi—Dirac 
distribution 
eU2DKE, —H) 
a= RED + | 


and for the Bose-Einstein distribution 


An 


An, = eVU2BCE, —#) 
a e ME,-/) —] 


596 Quantum Mechanics of Many-Body Problems 


15. Linear Response Theory 


We consider in this section a problem analogous to certain classical 
problems which have been dealt with in earlier chapters; that is, the 
problem of the response of a system to an external stimulus. The systems 
we consider are many-body ones, described by the statistical operator. 

If the Hamiltonian of the system can be written 


H =H,+H,F(t) (10.149) 


where Hy is taken to describe the unperturbed system and H,F(t) the 
external stimulus, it is possible to write p in the form 


P = Pot py (10.150) 
where 


j Po — [Hor pol (10.151) 
If the system Is in a steady state 


[Ho; po] = 0 (10.152) 


The essence of the theory of linear response is that we treat the 
perturbation only to first order; the response p, is therefore also kept 
only to first order. The equation for p,, is, then, 


i $F — (Ho, px] + F(OLH:, pal (10.153) 


The substitution 
~i = eW Hot f gitet (10. 1 54) 


transforms the equation to 


e thot; ie —_ F(t)(Ai,, Pol 


or 
iL — FH, pd (10.155) 


H(t) being the time-dependent operator 
A(t) = eT e thet (10.156) 
Integration of Eq. (10.155) gives 


f=—i [_, FHC), po) at’ (10.157) 


where it has been assumed that the perturbation was zero at time — ©. 


15. Linear Response Theory 597 


Consider now any dynamical variable A whose value is zero in the 
unperturbed state; its value in the system subject to the external stimulus 
is 


(A) = Tr pA 
— Tr eW thot feitot A 
= Tr fA(t) (10.158) 
where 
A(t) = ett 4 e—thot (10.159) 


To simplify (10.158), we note that 


Tr [H,(t'), pol A(t) = Tr Hy(t’) po A (t) — Tr poHy(t')A 
= Tr pol A(t), Hilt’) (10.160) 
Therefore, finally 


(A) =—iTr pof_, FAO, He) dt (10.161) 


This is the linear response formula, originally due to Kubo. 

The question of the convergence of (10.161) at the lower limit — 
is worthy of consideration. If F(t) is “turned on’’ suddenly at some 
finite time (say t = 0) the use of (10.161) is straightforward. Since the 
relation between (A) and F(t’) is linear, if we express F in terms of its 
Fourier components 


F(t')= J Fw) e" dw (10.162) 


the contributions from the different frequencies w are additive. Consider 
then the case of a single frequency 


F(t’) = e (10.163) 


In this case the integral (10.161) does not converge at the lower limit. 
Suppose, however, that the perturbation has been turned on slowly from 
some time in the distant past. This may be represented mathematically 
by multiplying (10.163) by e’, where we may if we wish take e vanish- 
ingly small. By this device, convergence is assured. 

Making use of the fact that po) commutes with AH, and that the order 
of two terms in the trace of a product may be reversed, we may write 
alternative forms for (A): 


( A) =—] lim eilo—iet Ty elo—ieXl’—0) pitt poet [ A (t), A(t’ )] dt' 


—o 


‘ etlo—iee’—OF 4 (t — t'), H,] dt’ 


=—-—j}j i(w—ie)t 
= i lim eX t€ Tr po f 


= —j et Tr Po lim, f, e~to-ie 4 (7), H,] dr (10.164) 


598 Quantum Mechanics of Many-Body Problems 


where, in the last line, the new variable 7 = t—?’ has been introduced. 
We see, then, that the response has the same periodic time variation as 
the applied perturbation. 


16. An Example: Transverse 
Conductivity of an Electron Gas 


As an illustration of the linear response formalism, we calculate the 
conductivity of an electron gas in a transverse field 


E = E,é et (10.165) 
where q : € = 0. The field may be expressed in terms of a vector potential 


A= Eoé eft (10.166) 


where V-A=iq-A=0. Since the one-particle Hamiltonian of an 
electron of charge — e in a field described by vector potential A is 


= 1a at \ 

H=5-(p+£a) 

=F += (p- A+A-p) (10.167) 
2m 


to the first order in the field, the many-body Hamiltonian splits into two 
parts, an unperturbed one 


Ho= [ wre) b(n’) der’ 


_ 1 h?k? 
=o > EtG a, 
4 STG, (10.168) 
k 
and the perturbation 
= __ teh —iwt | —iK'-r'’ . i(K+q)-r’ J3_/ 
HF (t) >moQ Ey ée & Cr tCx e (2K+ q) é e d*r 
ieh ae 
=—5 7 Eve ps Chyrgx (2K + q) -€ ; (10.169) 


Problem 10-12: Kubo has developed a general 
formula for electrical conductivity along the following lines: If X is the 


16. AnExample 599 


electrical polarization of the electrons, we may take 
H, ar E 7 ».¢ eiwt 
From (10.157) and (10.158), 


(ja(t))=—iTr fo FCC) LHC), pol jn(t) a 


Now 
(i) Show that 


BR. 
[H,(t’), Po] = —Ipo i HA, (t' —in) dn 
where the dot indicates a time derivative. 


(ii) Noting that X, =j,, and making a change of variable of integra- 
tion, show that the conductivity tensor is given by 


B —id 
(jut) =Tr fo da fo F(t+7+id)poie() ju(0) dr E, 
(iii) Putting F(t) = e**, show that 


1— e-& [* ; ; ; 
Ou = SE | inl7)jul0)) em a 


where 
(v(t) ju(0)) = Tr pos, (7) jn(0) 


where () is the volume within which the system is enclosed. Also, to be 
better able to check the dimensionality of our formulas, we have reintro- 
duced the quantum constant / throughout. Now the many-particle opera- 
tor for the current density is 


=— | wt (e')| (P +£A(r') )B(r —r)+8(r’—r) 


2m 


x (Pp +£a(r'))| u(r’) d*r’ 


where J, is given by 
J,=— 35, | vtlp’8e’ —r) +80 —1)p'ly d3y' (10.171) 


and 


Ja= — wt (r)A(r) (rr). (10.172) 


600 Quantum Mechanics of Many-Body Problems 


Again, putting 
So, e*'", ete., 
k 


these become 


Jn= x5 S) Gy te, | eer [p’d(r’ —r) + 8(r' —r)p’] e** d°r’ 


= <A BY cy tc, (k+k’) ef(k-k’)-r (10.173) 
and 
° oe 
Ja= ~ 5 bot S cyte, ete w+ eileen (10.174) 


The mean value of the current density is now seen to be made up of 
two parts. The contribution from J, must be calculated from linear 
response theory to first order in the field, using (10.161). Since J, is 
already of first order in the field, its contribution to the mean current 
density is 


Tr pole = = Ente & (ny) 
Pod a mo? é D> (nm 


= iNer fei -r—of) 


where NV is the total number of electrons. This gives a contribution to the 
conductivity 


oo = —— (10.175) 
By (10.161), the mean value of the other part of the current density is 


. t 
(Ip) = —;Tr po iy eT J(t), Hi (t’)] dt’ (10.176) 


But 
Jp(t) = “ia > etait toe Hoh (+k ee) 
ia son Stee othe te, (k+K' Je")? (10.177) 
and 
H(t) = FEE By Se tng eK +) 
ieh 


sae i Cx(2K+q) -§ (10.178) 


16. An Example 601 


Therefore, 
e*h 


e) = Fig oD (KTR) [OK +4): g]e*** 
X Tr poleyt Cys Chk+q Cleat 


< Jim fo enieteHara- ade eit dy! (10.179) 


By simple algebra, using the anticommutation relations 


[cy tCy, Cik+q Cx] am On,.K+q Cyt CK is Ox ct K+q Ck (10. 1 80) 
The trace of po times this is 
Sn x+qOuk LS (€x-q) —flex)] (10.181) 


f (e) being the Fermi distribution function defined above. If this is sub- 
tituted in (10.179), the equation for the mean current density becomes 


(I) = getter oof OE 8+ pag & Lf (ea) -f(e)3 


(2k —q)[{(2k—q) - é] 
w+ (1/h) (€ — €y~q) ahee) 


By virtue of the fact that q - € = 0 the second term may be simplified. 
For the sum to be different from zero, the summand must be even in 
k- &. Therefore only the component of (J) in the direction of € is non- 
zero, the current is in the direction of the field, and the conductivity 


is a scalar. Making use of this fact and replacing the sum by an integral, 
we obtain for the aac the formula 


. (k - €)? dk (10.183) 


w+ (1/h) (€x— &—q) 


The second term goes to zero as q — 0, that is, in the long-wavelength 
limit. It is in general very complicated, though it is not too difficult to get 
the leading term in an expansion in powers of q, which is of order q’. 
If (10.183) is broken into two integrals, and in the first one makes the 
substitution 
k=—k’+q 


and then drops the prime on the variable of integration, the two integrals 
combine into 


5 | Sek sab eeek (10.184) 


602 Quantum Mechanics of Many-Body Problems 


Expanding the numerator, we see that the term (42/m)k-q, which 
is the only term of order q, vanishes by symmetry. The remaining term, 
h?q?/2m, is already of order q?, so that to this order the denominator may 
be replaced simply by w’. The expression (10.184) then takes the approxi- 
mate form 


hq’ 


mo? 


| Fagard dk (10.185) 


At zero temperature, f= 1 for k < ky, zero for k = ky. In that case 
(10.185) becomes 
hg’ 4a 
mo) 1§ 
Finally, substituting in (10.183), we obtain the conductivity up to quad- 
ratic terms in q?: 


kp 


5 mw q 


o = Nel (10.186) 


1 Atk? ) 
mM@ 


17. Propagation of 
Electromagnetic Waves 
in an Electron Gas 


In Chapter 5, Eq. (5.550), we gave an equation for the dielectric func- 
tion of a material consisting of free electrons and other polarizable 
components (ions, valence electrons). If we consider only frequencies 
low enough that the variation with w of the contribution e, from these 
other components is negligible, and substitute the conductivity (10.186), 
we obtain for the frequency-dependent transverse dielectric response 


_i iNe (1 41 at) 


€=e,+ 
+" @€ mo 5 mo’ 


2 2, 2,72 
=, {1-25 (143 SF) (10.187) 


where w,” is. as defined in (5.576). The dispersion relation for waves in 
the medium is then given by (5.580), 


c2 2 2 1 Azk 2 2 
rae [1 (145 GP) 10.189) 
or 
€, (w* — wp") (10.189) 


c2g? = ——_—____-L\" ep ) 
1+ (1/5) €,(wp?/w?) (A?kp?/m?c?) 


17. Propagation of Electromagnetic Waves 603 


It is clear, since Ak; < mc for real electron densities, that the second 
term in the denominator is very small for w < ay». 

It follows also that waves can only be propagated for w greater than 
the plasma frequency w,. Below that frequency, electromagnetic energy 
is dissipated in creating individual electron excitations. Above it, the 
wave is a coupled mode of particles and electromagnetic field; the motion 
of the particles creates the field which in turn governs their motion. 


INDEX 


INDEX 


A priori probability, 416 
Advanced potentials, 355 
Algebra of vectors, 72ff 
Allowed frequencies, 25ff 
Ampere’s Law, 104 
Angular momenta 
addition of, 500, S20ff 
Angular momentum, 499 
commutation relations for operators, 
$10-512 
ladder operator methods, 512-513 
matrix representation of operators, 515- 
517 
operators in polar coordinates, 513-515 
and spherical harmonics, 515 
stretched state, 513 
Annihilation operator, 505, S65ff 
Associated Legendre Functions 
normalization integral, 121 
recursion relation for, 145 
Autocorrelation function, 414 


Basis 
orthonormal, 67 
of vectors, 67, 74 


Bessel equation, 33, 151,211,221 
second solution, 215 
Bessel function, 42, 211 
asympotic expansions, 220 
general properties, 212-214 
integral representations, 213,223 
integrals, 222 
large order, 343-344 
in radial wave equation, 225 
recursion relations, 221 
spherical, 220 
and waves in cylinder, 242-243 
zeros of, 214, 215-217 
Beta function, 151, 198-199 
Bohr, Peierls and Placzek 
optical theorem for scattering, 227, 277 
Boltzmann constant, 593 
Bose-Einstein distribution function, 561 
derivation of, 594-595 
Bosons, 559, 561, 564 
commutation relations for, 559, 565 
energy of non-interacting, 566 
number operator for, 566 
Boundary conditions 
for eigenvalue problems, 34 
in string problem, | 1 


607 


608 index 


Bow waves, 226, 261, 324 
Bra vectors, 72 
Breit-Wigner formula, 501, 557 


Canonical ensemble 
density matrix of, 592-594 
Causal functions, 149, 161 
Central limit theorem, 414, 431 
Cerenkov radiation, 228, 324ff 
fields of, 326 
and supersonic sound waves, 261 
Charge distribution 
in electrostatics, 134 
Charged liquid drop 
Coulomb energy, 319-320 
energy radiated, 321 
kinetic energy, 317 
multipole moments, 321 
radiation from, 3 1 6ff 
stability, 320 
surface energy, 318-319 
Charged particle 
force on, in electromagnetic theory, 292 
in magnetic field, 500 
Chemical potential, 594 
Clebsch-—Gordon coefficients, 500, 52 1ff 
calculation in particular cases, 523-528 
Case/; = 4, jo = 4, 523-524 
Case j; = 1, jo = 4, 524-525 
Case j; = 1, je = 1, $25-527 
technique of calculation, 522-523 
‘“*Closed loop” processes, 444 
Coherence effects in superconductors, 353 
Coherence length in superconductors, 353, 
406 
Coin-tossing problem, 416-420 
Commutation rules 
for bosons, 565 
for fermions, 567 
for w-operators, 568 
Commutator, 451, 454-456 
Completeness, 89 
of eigenfunctions, 45-46 
Complex potential 
in scattering problems, 554 
Condensation 
of gas, 225 
in sound waves, 234 
Conductivity 
of electron gas, 562 
Confluent hypergeometric equation, 201ff 


equation reducible to, 211 
second solution for, 204 
Confluent hypergeometric function, 151, 
201ff 
and Hermite functions, 151, 207-210 
integral representation, 201, 202 
and Laguerre functions, 151, 205-206 
series for, 201-202 
and three-dimensional harmonic oscil- 
lator, 501 
Conservation of momentum 
in scattering, 571 
Continuity, equation of 
connection with Maxwell’s equations, 283 
in electromagnetic theory, 282 
Continuous spectrum 
of eigenvalues, 44 
Continuous variables 
in quantum mechanics, 461 ff 
Correlation functions, 149, 164 
autocorrelation function, 164-165, 441 
cross-correlation, 165-166 
Cosmic ray shower 
fluctuations in, 414, 434-436 
Coulomb gauge, 301 
Creation operation, 505, 507 
Creation-annihilation formalism, 499 
Critical conditions 
for neutron chain reactors, 352, 387, 389 
Cross-section in scattering problems 
for absorption, 276-277, 552, 555 
definition, 274 
in electromagnetic theory, 226 
inequalities satisfied by, 277-280, 553 
of partial waves, 275, 277-280, 553 
for scattering, 274-275, 551-552, 554 
total, 277, 553. 
Curvilinear coordinate systems, 97 
Current operator 
‘“‘diamagnetic”’ current, 599-600 
“paramagnetic” current, 599-600 
in second quantization formalism, 599, 
600 


Dead time 

in radioactive counting, 413, 425 
Decay constant 

of quantum state, 485 
Delayed neutrons 

in chain reactors, 386, 389, 390 
Delta-function, 44, 176-177 


Density fluctuations in gas, 421 
Density matrix, 561, 587-594 

of canonical ensemble, 561, 592-594 

definition, 587 

Fano’s approach, 590-591 

of grand canonical ensemble, 561, 594 

properties of, 588-591 

and statistical mean values, 588 

time variation, 588-589 

unitary transformation of, 589 
Diatomic molecule, 547, 548-549 
Dielectric constant, 104, 282 
Dielectric tensor 

in magnetic field, 329-332 
Differential equations with linear coeffi- 

cients, 199-200 

Diffusion 

equation, 351, 357ff 

boundary conditions for, 358 

in gas or solid, 357 

length of thermal neutrons, 352, 393 

of neutrons, 351, 378ff 
Dirac, P. A. M., 67, 71 

formulation of quantum mechanics, 443ff 
Direct product 

of vector spaces, 449, 458 
Disc, 

potential of, 144 
Dispersion relation 

of electromagnetic waves in Fermi gas, 602 
Dispersion relations 

of Kramers-Kronig, 162-163 
Displacement, electric, 28 1ff 
Dissipative term 

harmonic-oscillator, 157-159 
Distribution of energy 

among N particles, 438-440 
Distributions, 44-45 
Dynamical variables 

classical, 457 

non-classical, 457 

in quantum mechanics, 448, 449 


Effective mass, 581 
Eigenfunctions, expansion in, 42 
orthogonality of, 35 
Eigenvalue problem, 3, 67 
Eigenvalues, 68 
calculation by variational method, 38 
of Hermitian operators, 443, 448ff 
Eigenvectors 


Index 609 


of Hermitian operators, 443, 448ff 
Elastic scattering 
by small sphere, 280-281 
Electromagnetic fields, 101-105, 281ff 
energy density of, 227, 293 
energy flux of, 227, 293 
generalized momenta of, 296 
Hamiltonian of, 131,227, 295-297 
Lagrangian of, 131, 227, 295-297, 300 
momentum density of, 293 
normal modes of, 297ff 
stress tensor of, 293-294 
wave equation for, 284ff 
Electromagnetic waves 
angular momentum of, 227 
in a dissipative medium, 352, 395ff 
in enclosures, 226 
Green’s function for, 352, 396-399 
propagation into conducting medium, 400 
resonant, 226 
Electrostatic potential 
expansion in spherical harmonics, 125 
Electron gas 
transverse conductivity, 598-602 
see Fermi gas 
Energy absorption 
of string, 48 
Energy density 
of string, 15 
Energy flux 
of sound waves, 251 
of string, 16 
Equation of continuity 
for diffusion processes, 357 
in neutron diffusion, 379 
Equation of motion 
vibrating string, 5-8 
Equation of state 
for a gas, 234 
Error integral 
in probability theory, 420 
Euler’s angles, 85 
and coordinate transformation, 125 
Exchange energy, 578 
Excitations, spectrum of, 2 


Factorization Method 
for hydrogenic atom, 544-546 
of Infeld and Hull, 97, 106 
for Legendre equation, 107-110 
Faraday’s law of induction, 104 


610 Index 


Fast neutrons 
moderation of, 383 
Fermions, 559, 561, 564 
commutation relations, 567 
Fermi-Dirac distribution function, 561 
derivation of, 594-595 
Fermi gas, 561, 581ff 
energy of, 582-583 
exchange energy, 561, 582-583, 586 
ferromagnetism of, 561, 584-585 
kinetic energy, 561, 582, 585 
Ferromagnetism 
of Fermi gas, 561 
Fick’s Law, 351, 357 
for neutron diffusion, 379 
Field, irrotational, 97, 285, 297 
solenoidal, 297 
Fission of nuclei, 352 
Fluctuations 
in multiplying systems, 434-436 
Fluid mechanics 
continuity equation, 233 
equation of state, 234 
equations of motion, 232ff 
Forbidden bands 
of frequency, 2, 25ff 
Forward scattering amplitude, 275 
Fourier 
integral, 154-155 
series, 14,89, 154, 351 
transform, 149, 150ff 
convolution theorem, 157 
of oscillator equation, 355 
and probability distributions of con- 
tinuous variables, 428-429 
and solution of Poisson’s equation, 168 
theorems on, 155-156 
in three dimensions, 166-168 
in vibrating string problem, 193-195 
Frequency distribution function, 165 


Gamma function, 151, 195-196 
asymptotic form, 340 
for negative argument, 196 
Gauge 
Coulomb, 227 
transformation, 286 
Gauge transformation 
of electromagnetic potentials, 286 
Gauss’ theorem, 104 
Gaussian distribution, 413 


Generalized momentum 
of sound waves, 241 
“‘Golden rule” formula, 485 
Goldstein, L. 
Classical Mechanics, 60, 85 
Grand canonical ensemble 
density matrix of, 594 
Green’s functions, 3 
for diffusion equation, 360 
for electromagnetic waves, 396-399 
in electrostatics, 130 
for free-particle Schroédinger equation, 
376-378 
for heat conduction, 360ff 
for heat conduction equation with various 
boundary conditions, 362-369 
heat conduction in one dimension, 361 
for heat conduction in a sphere, 351, 369 
and image method, 130-132 
for London superconductors, 408-41 1 
for neutron diffusion, 381-383 
in potential problems, 98, 99 
reciprocity theorems for, 264, 372-374 
relations between, for different geo- 
metries, 135 
for sound waves, 225-226, 264-267 
for superconducting electrodynamics, 353 
for vibrating string, 48, 50-52 
Guided waves 
of sound, 242 


Hamilton’s Principle, 60 
Hamiltonian 
of electromagnetic field, 131, 227, 295- 
297 
of sound waves, 225 
of string, 4, 63 
Hankel functions, 217-219 
asymptotic forms, 220 
representing spherical waves, 225 
spherical, 220 
spherical, of large order, 341-348 
Harmonic oscillator, 149, 499-500, 503ff 
classical, 355 
in one dimension, 499, S04ff 
coordinate matrix elements, 508 
creation and annihilation operators, 
505-507 
normalization of wave functions, 509- 
510 
velocity matrix elements, 508 


wave functions of, 508-510 
in three dimensions, 499-500 
angular momentum of states, 536, 538 
Laguerre polynomials in solution, 542 
normalization of states, 538 
by operator methods, 534-540 
parity of states, 534 
in polar coordinates, 537, 540-542 
by separation of variables, 533, 534, 
540-542 
wave functions for, 535, 538, 539 
in two dimensions, 499-500 
angular momentum of states, 531-532 
in cylindrical coordinates, 527-530 
operator method, 530-533 
by separation of variables, 529-530 
wave functions, 530, 532-533 
Harmonics 
energy in, 18 
kinetic and potential energies of, 19 
on vibrating string, 2 
Hartree-Fock approximation, 
573-579 
for free interacting particles, 579-581 
Hartree-Fock eigenfunctions 
orthogonality of, 578 
Hartree-Fock equations 
derivation of, 573-577 
for free interacting particles, 579-580 
Heat conduction, 351, 355ff 
boundary conditions, 359 
equation, 358 
equation of continuity for, 358 
with general initial and boundary condi- 
tions, 372-376 
solution in Green’s functions, 372-376 
and specific heat, 359 
Heat reservoir, 592 
Heisenberg matrix mechanics, 67 
Heisenberg representation, 443, 469 
Helicons, 228, 333 
dispersion relation, 333 
excitation by a plane wave, 336-338 
transmission of, 337 
Hermite equation, 80 
Hermite polynomials, 67, 79, 499 
and one-dimensional harmonic oscillator, 
509 
Hermite functions, 207-210 
as confluent hypergeometric, 209-210 
integral representation, 208 
orthonomality integrals, 209 


560-561, 


index 611 


recursion relations, 208 
Hermitian operators, 67, 86-88, 443 
eigenvalues and eigenvectors of, 68, 443, 
457 
orthogonality of eigenvectors, 88 
reality of eigenvalues, 68, 87 
Hertz vector, 227, 287 
Hydrogen atom 
Coulomb interaction of, 180-181 
Hydrogenic atom, 126, 501, 542ff 
energy eigenstates, 545 
energy eigenvalues, 545 
factorization methods for, 503, 544-546 
normalization of eigenfunctions, 545 
positive energy states, 545 
recursion formulae for eigenfunctions, 
$44-545 
reduced mass, 543 
by separation of variables, 543 


Images, method of, 99 
Incoming waves, 250 
Indistinguishability of particles, 564 
Inhomogeneous problem 
vibrating string, 47 
Initial and boundary value problem for 
sound waves, 264 
Instability of zero-spin state in fermi gas, 
583 
Interaction 
of charged particles and electromagnetic 
fields, 291 
operators, in second quantization, 569- 
572 
representation, 443, 470ff 
Interference of measurements, 453 
Intrinsic angular momentum 
of electromagnetic waves, 289-290 
Inverse lattice, 172 
Inversion in sphere, 132-134 
Ionization energy 
and Hartree-Fock method, 574 
Irreversible processes, 351, 355 
Irrotational field, 297-298 


JWKB method, 3,55 
example, 356 
Jordan and Wigner 
commutation relations for fermions, 567 


612 index 


K-operator, 482ff 
Ket vectors, 72 
Kramers—Kronig 
dispersion relations, 149, 162-163 
Kubo 
conductivity derived by, 598-602 
linear response theory, 561, 596-598 


Ladder operators, 499 
Lagrangian, 
in electromagnetic theory, 227, 295-297 
for sound wave, 225, 239-241 
of vibrating string, 4, 58 
Lagrangian equation for string, 60-61 
Laguerre equation, 80, 205 
Laguerre functions, 205-206 
associated, 206-207 
generating function, 207 
Laguerre polynomials, 67, 79 
in two-dimensional harmonic oscillator, 
530 
Laplace equation, 97, 102 
general solution, 111,112 
in spherical polar coordinates, 1 OSff 
Laplace transforms, 149, 182ff 
calculation of, 184-187 
calculation of inverse, 186-189 
convolution theorem, |! 83 
of derivatives, 182-183 
inverse theorem, 150, 188 
of periodic functions, 189 
in resonance phenomena, 190-193 
shifting theorem, 183 
in vibrating string problem, 193 
Large number limit 
and Gaussian distribution law, 418-420 
Lattice 
and Poisson summation formula, 150 
Legendre equation, 80, 1 06ff 
Legendre polynomials, 67, 79, 112-118 
associated, 120-122 
generating function, 112-114 
integrals involving, 114 
recursion relations, 115-116 
and static multipoles, 116 
Lienard—Wiechert potentials, 323 
Lighthill, M. J., 44 
Linear differential equations of second order 
Green’s function for, 147 
theory of, 145ff 
Linear independence, 73 


Linear response functions, 163 

Linear response theory, 561, 596-598 
conductivity derived from, 598-602 

Linear vector space, 7 1ff, 443, 449ff 
dimensionality of, 73 

Liquid drop 
dynamics of charged, 228 

Loaded string, 19 

Longitudinal vibrations of string, 8 

Lorentz condition, 286 

Lorentzian distribution, 441 


Madelung constant, 150, 176 
calculation of, 174-176 
Magnetic permeability, 282 
Magnetoplasma waves in solids, 326ff 
dielectric tensor of, 331-332 
longitudinal waves, 332 
propagation perpendicular to field, 334- 
335 
transverse waves, 333 
Many-body problems, 559, 563ff 
Many-body systems 
density of states, 586-587 
excited states of, 574 
Markoff method, 414, 436ff 
Matrix 
column, 82 
representation of linear operators, 84 
representation of vectors, 82 
row, 83 
transformation of, 83, 85 
unitary, 84 
Maxwell distribution, 440 
Maxwell’s Equations, 104, 262ff 
Mean particle density 
in many-body systems, 591 
Mean value 
definition, 417 
quantum, 443, 451 
quantum statistical, 488 
Measurement 
quantum, 448 
Meson equation, 411 
Method of images 
in electrostatics, 130-134 
in heat conduction, 362-365 
in thermal neutron diffusion, 381-382 
Migration length of neutrons, 352, 394 
Moderation of neutrons in nuclear reactors 
equation for, 383-385 


Moments, multipole, 117-118 
Momentum 
and Fourier transforms, 466, 468 
operators, 462-464 
representation, 443, 465-468 
Multipole moments, 98, 126, 227 
Multipole fields, 305ff 
angular momentum of, 310, 314 
contribution from magnetic moments, 
313,315 
electric multipole radiation, 3 1 Off 
in electromagnetic theory, 305ff 
energy flux of partial waves, 315 
energy of, 309 
magnetic multipole radiation, 309ff 
potentials for, 306, 313-314 
Multipole radiation, 225ff 
“electric” radiation, 227 
of electromagnetic waves, 227 
‘“‘magnetic”’ radiation, 227 


Neutron chain reactor 
critical conditions, 387 
decay of growth of neutrons, 389 
model of, 385-386 
solutions of steady-state equations, 390- 
391 
Neutron diffusion, 351, 352, 378ff 
see Diffusion of neutrons 
‘fast’ nuetrons, 378 
and fission, 378 
neutron capture, 378 
thermal neutrons, 378-379 
Neumann functions, 219 
definition, 219 
of large order, 345 
spherical, 220 
Normal modes, orthogonality of, 3, 25, 
298-299 
Nuclear reactor, 385ff 
critically, 352, 389ff 
Number operator, 566, 567 


Ohm’s Law, 283 
Operator 
Hermitian, 86-89, 45 1ff 
time-dependent, 469 
unitary, 84, 45 9ff 
Optical theorem 
in scattering theory, 227,277 


Index 613 


Orthogonal polynomials, 77-81 
Orthonormal set of vectors, 451 
Outgoing waves, 250 
Overlap integral, 179 


Parseval’s Theorem, 155-156, 165 
Pauli exclusion principle, 567 
Periodic lattice, 2,27 
Permeability, magnetic, 104 
Perturbation 
first order, 490-491 
due to oscillating potential, 489 
time-dependent, 488 
Perturbation series, 444, 445 
Perturbation theory 
degenerate case, 494 
expansion, 477-478 
Rayleigh-Schrédinger, 445, 492-495 
of stationary states, 445, 491ff 
with time-independent perturbations, 
476ff 
Wigner-Brillouin, 445, 496-497 
Perturbations 
in string problem, 53 
Plane electromagnetic waves, 301ff 
circularly polarized, 303 
energy density, 302 
as normal modes, 304-305 
Poynting vector, 302 
Plane wave 
expansion in spherical waves, 252-254, 
550 
Plasma frequency, 333, 562, 602 
Plasma modes 
in a magnetic field, 228 
Plasmas, 228 
equations for, 327-328 
in magnetic field, 326 
magnetoplasma dispersion relations, 328 
Poisson bracket, 457, 468 
and commutators, 457 
Poisson distribution, 413, 414, 422-423 
Gaussian approximation in large number 
limit, 424 
Poisson summation formula, 150, 169ff, 
363, 365, 381 
and method of images, 381-382 
vector form, 172-174 
Poisson’s equation, 97, 103, 168ff 
Green’s function, 169 
solution by Fourier transforms, 168-169 


614 Index 


Polarization 
of electromagnetic waves, 287, 291 
Positronium atom, 547 
Potential, 
electrostatic, 97, 1OIff 
of axially symmetric distribution, 127 
scalar, 101 
vector, 97, 103 
gravitational, 129 
theory, problems in, 138-144 
Potential barriers, 2 
Power spectrum, 149 
of time series, 440 
Probabilities 
in quantum mechanics, 450 
Probability distribution 
of continuous variables, 427ff 
of independent variables, 428 
of sum of random variables, 196 
Probability generating functions, 413, 416ff 
continuous variables, 427 
moments of, 417 
Projection operators, 444, 453 
in perturbation theory, 492-493 
in time-dependent perturbation theory, 
479-482 
Propagator 
in quantum mechanics, 444, 473-475 
Pure radiation field, 301 


Quantum mechanics 
hypotheses of, 448ff 

Quantum states 
degenerate, 452 
non-degenerate, 452 
‘“‘width” of, 444 


Radiation 
of heat 
boundary condition of insulation, 362- 
364 
boundary conditions for, 359 
radiative boundary conditions, 362-367 
from a moving charge, 322ff 
from moving source of sound, 225-226, 
259-263 
multipole moments of radiating source, 
256-257 
from an oscillating source, 254-259 
Random processes, 413, 41 5ff 
Random walk problem, 414 


and diffusion, 414, 433 

by Markoff method, 436-438 
Rayleigh 

sixth power law, 227, 281 


Rayleigh-Ritz variational principle, 3, 
36-42 
Reciprocity theorem 


for Green’s functions, 264, 372-374 
Reflection of waves on a string, 1-2, 11-13, 
20, 22,28, 29 
Regularly spaced masses on a string, 24 
Renormalization, 444 
Renormalized state, 479 
Representation of operators 
interaction operators, 569 
in second-quantization formalism, 566- 
568 
Resonance, 3, 50, 190 
in harmonic oscillator, 191 
Resonant frequencies 
sound waves, 269 
Retarded potentials, 355 
Root mean square deviation, 413-414, 
417-418 
Rotation operator, 289 
and angular momentum 
mechanics, 289 


in quantum 


5S matrix, 2 
for scattering on string, 23 
Scalar potential 
for electromagnetic waves, 285 
Scalar product, 67, 74 
Scattering 
of electromagnetic waves, 226 
of free particles, 560 
operator, 549 
quantum-mechanical, 475 
sound waves, 272ff 
of waves on a string, 20ff 
by nonlinearity, 62-63 
Scattering, quantum, 301, 549ff 
cross-section, 551-553 
by a “‘hard sphere’, 553 
low-energy, 501, 553-556 
partial wave expansion for, 501, 503, 
549-553 
potential, 501, 556-557 
resonant, 501 
scattered wave, 550 
by short wave potential, 501, 549-553 


Schmidt inequality, 75 
Schmidt orthogonality procedure, 67, 76-77 
example, 77-79 
Schrédinger wave mechanics, 67 
representation, 443, 469-470 
time-dependent wave equation, 352 
wave equation, 462-465 
wave functions, 443, 463, 469 
Schwinger’s method 
in angular momentum theory, 499, 517- 
520 
Second quantization formalism, 499, 559, 
563ff 
Self-energy 
of system, 482 
Separation of variables, 2, 30 
for spherical wave equation, 225 
Sixth power law 
for scattering by small spheres, 227, 281 
Skin effect, 352-353, 401-402 
Slowing-down length 
in neutron diffusion, 393 
Solenoidal field, 103, 297-298 
Sound waves, 225, 23 1ff 
approximations in deriving wave equa- 
tion, 236 
derivation of equations, 232~—233 
energy of, 237 
formula for radiated energy, 258 
Green’s functions for, 245-248 
group velocity, 239 
in guides, 225, 242-243, 270-272 
Hamiltonian of, 240-241 
incident wave, 273 
inhomogeneous equation, 244 
Lagrangian of, 239-240 
momentum density, 238 
phase velocity, 239 
propagation down cylindrical tube, 270- 
272 
in spherical enclosure, 270 
velocity of, 234 
scattering of, 272ff 
scattered wave,273 
Specific heat 
of solid, 359 
Spherical Bessel functions 
large order, 33 8ff 
as solutions of radial wave equation, 250 
Spherical harmonics, 97, 111, 121 
addition theorem for, 122-124 
orthogonality of, 122 


Index 


Spherical wave equation, 225 
Spherical waves, 249ff 
Spheroid 

potential of, 142-143 
Spin, 449, 511, 515-516 

of scalar field, 288 

of vector field, 288-291 
Spinor operators, 499, 518 
Square root law 

for deviations, 418 
Statistical operator, 561 

(see density matrix) 
Steepest descents 

method of, 151,228, 338-340 
Stefan’s Law, 359 
Stern—Gerlach experiment, 454 
Stirling’s formula, 196-198 

in probability theory, 419 
Stochastic processes, 415, 440 
Stokes’ theorem, 102 
Sturm-Liouville problem, 91-94, 153 
Sturm-Liouville equation, 2, 32 
Sum of random variables 

central limit theorem, 431 

distribution of, 429 
Sum rules, 459 
Summation of series 

by Poisson formula, 169ff 
Superconductors 

electrodynamics of, 351, 403 

London model, 403-404 

Pippard equation, 406-408 
Supercurrent, 403 
Superposition 

of quantum states, 450 
Symmetry 
_ of eigenfunctions, 40 
Symmetry and antisymmetry 

of wave functions, 569 


T matrix (operator), 478, 482-483 
Temperature 

definition of, 593 
Thermal neutrons 

diffusion of, 379-383 
Three-center integral, 180 
Time-dependent perturbation, 445 
Time evolution 

operator, 444, 472ff 

of quantum systems, 443, 468ff 
Transform methods 


615 


616 index 


equations with linear coefficients, 199- 
201 

in potential problems, 168-169 

in string problem, 193-195 

for wave equation, 245-248 
Transition 

probability of, 444, 483, 485 

to definite final state, 485-487 
Transmission of waves 

on vibrating string, 20, 22, 28, 29 
Trial functions 

in variational principles, 39 
Tschebycheff polynomials, 67, 79 

equation, 81 
Two-center integrals, 178—180 
Two-particle interactions, 560, 569 


Uncertainty principle, 443, 455, 456 
Unitary matrix, 23 
Unitary operators, 443, 459-461 
Unitary transformation, 67 
of density matrix, 589 
Units 
in electromagnetic theory, 228, 348-349 


Vectors, 71ff 
bra, 72 
conjugate, 72 
ket, 72 
Vector field 
irrotational, 279ff 
solenoidal, 297ff 
Vector potential 
for electromagnetic fields, 285ff, 403ff, 
598-599 


Vector space, 73 
Vector waves, 226 
Velocity potential 
for sound waves, 225, 235 
Velocity of sound, 234 
Vibrating string, 1, 3, 5ff 
Vibrating string equation 
by Fourier and Laplace transforms, 193- 
195 
Virial theorem, 470 


Waves 
electromagnetic, 281ff 
sound, 231ff 
on string, 1, 3ff 
Waves, converging, 225, 250, 275-276, 550 
diverging, 225, 250, 275-276, 550 
partial, 276, 549ff 
reflected, 20-24 
scattered, 273-276, 550, 554 
transmitted, 273-276, 550-554 
Wave equation 
for electromagnetic fields, 284ff 
in one dimension, 7ff 
Wave equation, spherical, 225, 249ff 
Green’s function for, 306 
Waves in enclosures, 268-270 
spherical enclosures, 270 
Weighting function, 79-80 
Width, of quantum states, 444 
Work function, 574 


Young’s modulus of string, 8 
Yukawa 
and classical meson theory, 353 
meson equation, 411 


A CATALOG OF SELECTED 
DOVER BOOKS 


IN SCIENCE AND MATHEMATICS 


LL 


A CATALOG OF SELECTED 


DOVER BOOKS 


IN SCIENCE AND MATHEMATICS 


QUALITATIVE THEORY OF DIFFERENTIAL EQUATIONS, V.V. Nemytskii 
and V.V. Stepanov. Classic graduate-level text by two prominent Soviet mathemati- 
cians covers classical differential equations as well as topological dynamics and 
ergodic theory. Bibliographies. 523pp. 5% x 84. 65954-2 Pa. $14.95 


MATRICES AND LINEAR ALGEBRA, Hans Schneider and George Phillip 
Barker. Basic textbook covers theory of matrices and its applications to systems of 
linear equations and related topics such as determinants, eigenvalues and differential 
equations. Numerous exercises. 432pp. 5% x 8%. 66014-1 Pa. $10.95 


QUANTUM THEORY, David Bohm. This advanced undergraduate-level text pre- 
sents the quantum theory in terms of qualitative and imaginative concepts, followed 
by specific applications worked out in mathematical detail. Preface. Index. 655pp. 
5% x 8b. 65969-0 Pa. $14.95 


ATOMIC PHYSICS (8th edition), Max Born. Nobel laureate’s lucid treatment of 
kinetic theory of gases, elementary particles, nuclear atom, wave-corpuscles, atomic 
structure and spectral lines, much more. Over 40 appendices, bibliography. 495pp. 
5% x 814. 65984-4 Pa. $13.95 


ELECTRONIC STRUCTURE AND THE PROPERTIES OF SOLIDS: The 
Physics of the Chemical Bond, Walter A. Harrison. Innovative text offers basic 
understanding of the electronic structure of covalent and ionic solids, simple metals, 
transition metals and their compounds. Problems. 1980 edition. 582pp. 6% x 9%. 
66021-4 Pa. $16.95 


BOUNDARY VALUE PROBLEMS OF HEAT CONDUCTION, M. Necati 
Ozisik. Systematic, comprehensive treatment of modern mathematical methods of 
solving problems in heat conduction and diffusion. Numerous examples and prob- 
lems. Selected references. Appendices. 505pp. 5% x 874. 65990-9 Pa. $12.95 


A SHORT HISTORY OF CHEMISTRY (3rd edition), J.R. Partington. Classic 
exposition explores origins of chemistry, alchemy, early medical chemistry, nature of 
atmosphere, theory of valency, laws and structure of atomic theory, much more. 
428pp. 5% x 8%. (Available in U.S. only) 65977-1 Pa. $11.95 


A HISTORY OF ASTRONOMY, A. Pannekoek. Well-balanced, carefully reasoned 

study covers such topics as Ptolemaic theory, work of Copernicus, Kepler, Newton, 

Eddington’s work on stars, much more. Illustrated. References. 521pp. 5% x 8. 
65994-1 Pa. $12.95 


PRINCIPLES OF METEOROLOGICAL ANALYSIS, Walter J. Saucier. Highly 
respected, abundantly illustrated classic reviews atmospheric variables, hydrostatics, 
static stability, various analyses (scalar, cross-section, isobaric, isentropic, more). For 
intermediate meteorology students. 454pp. 6' x 94%. 65979-8 Pa. $14.95 


CATALOG OF DOVER BOOKS 


RELATIVITY, THERMODYNAMICS AND COSMOLOGY, Richard C. 
Tolman. Landmark study extends thermodynamics to special, general relativity; also 
applications of relativistic mechanics, thermodynamics to cosmological models. 
50lpp. 5% x 84. 65383-8 Pa. $13.95 


APPLIED ANALYSIS, Cornelius Lanczos. Classic work on analysis and design of 
finite processes for approximating solution of analytical problems. Algebraic equa- 
tions, matrices, harmonic analysis, quadrature methods, much more. 559pp. 5% x 84. 

65656-X Pa. $13.95 


INTRODUCTION TO ANALYSIS, Maxwell Rosenlicht. Unusually clear, accessi- 
ble coverage of set theory, real number system, metric spaces, continuous functions, 
Riemann integration, multiple integrals, more. Wide range of problems. 
Undergraduate level. Bibliography. 254pp. 5% x 8%. 65038-3 Pa. $8.95 


INTRODUCTION TO QUANTUM MECHANICS With Applications to 
Chemistry, Linus Pauling & E. Bright Wilson, Jr. Classic undergraduate text by Nobel 
Prize winner applies quantum mechanics to chemical and physical problems. 
Numerous tables and figures enhance the text. Chapter bibliographies. Appendices. 
Index. 468pp. 5% x 87. 64871-0 Pa. $12.95 


ASYMPTOTIC EXPANSIONS OF INTEGRALS, Norman Bleistein & Richard A. 
Handelsman. Best introduction to important field with applications in a variety of sci- 
entific disciplines. New preface. Problems. Diagrams. Tables. Bibliography. Index. 
448pp. 5% x 84. 65082-0 Pa. $12.95 


MATHEMATICS APPLIED TO CONTINUUM MECHANICS, Lee A. Segel. 
Analyzes models of fluid flow and solid deformation. For upper-level math, science 
and engineering students. 608pp. 5% x 8'4. 65369-2 Pa. $14.95 


ELEMENTS OF REAL ANALYSIS, David A. Sprecher. Classic text covers funda- 
mental concepts, real number system, point sets, functions of a real variable, Fourier 
series, much more. Over 500 exercises. 352pp. 5% x 84. 65385-4 Pa. $11.95 


PHYSICAL PRINCIPLES OF THE QUANTUM THEORY, Werner Heisenberg. 
Nobel Laureate discusses quantum theory, uncertainty, wave mechanics, work of 
Dirac, Schroedinger, Compton, Wilson, Einstein, etc. 184pp. 5% x 87. 

60113-7 Pa. $6.95 


INTRODUCTORY REAL ANALYSIS, A.N. Kolmogorov, S.V. Fomin. Translated 
by Richard A. Silverman. Self-contained, evenly paced introduction to real and func- 
tional analysis. Some 350 problems. 403pp. 5% x 84. 61226-0 Pa. $10.95 


PROBLEMS AND SOLUTIONS IN QUANTUM CHEMISTRY AND 
PHYSICS, Charles S. Johnson, Jr. and Lee G. Pedersen. Unusually varied problems, 
detailed solutions in coverage of quantum mechanics, wave mechanics, angular 
momentum, molecular spectroscopy, scattering theory, more. 280 problems plus 139 
supplementary exercises. 430pp. 6% x 944. 65236-X Pa. $13.95 


CATALOG OF DOVER BOOKS 


ASYMPTOTIC METHODS IN ANALYSIS, N.G. de Bruijn. An inexpensive, com- 
prehensive guide to asymptotic methods—the pioneering work that teaches by 
explaining worked examples in detail. Index. 224pp. 5% x 8%.  64221-6 Pa. $7.95 


OPTICAL RESONANCE AND TWO-LEVEL ATOMS, L. Allen and J. H. Eberly. 
Clear, comprehensive introduction to basic principles behind all quantum optical 
resonance phenomena. 53 illustrations. Preface. Index. 256pp. 5% x 8’. 

65533-4 Pa. $8.95 


COMPLEX VARIABLES, Francis J. Flanigan. Unusual approach, delaying com- 
plex algebra till harmonic functions have been analyzed from real variable view- 
point. Includes problems with answers. 364pp. 5% x 8'4. 61388-7 Pa. $9.95 


ATOMIC SPECTRA AND ATOMIC STRUCTURE, Gerhard Herzberg. One of 
best introductions; especially for specialist in other fields. Treatment is physical 
rather than mathematical. 80 illustrations. 257pp. 5% x 8's. 60115-3 Pa. $7.95 


APPLIED COMPLEX VARIABLES, John W. Dettman. Step-by-step coverage of 
fundamentals of analytic function theory—plus lucid exposition of five important 
applications: Potential Theory; Ordinary Differential Equations; Fourier Transforms, 
Laplace Transforms; Asymptotic Expansions. 66 figures. Exercises at chapter ends. 
512pp. 5% x 84. 64670-X Pa. $12.95 


ULTRASONIC ABSORPTION: An Introduction to the Theory of Sound 
Absorption and Dispersion in Gases, Liquids and Solids, A.B. Bhatia. Standard ref- 
erence in the field provides a clear, systematically organized introductory review of 
fundamental concepts for advanced graduate students, research workers. Numerous 
diagrams. Bibliography. 440pp. 5% x 8'4. 64917-2 Pa. $11.95 


UNBOUNDED LINEAR OPERATORS: Theory and Applications, Seymour 
Goldberg. Classic presents systematic treatment of the theory of unbounded linear 
operators in normed linear spaces with applications to differential equations. 
Bibliography. I99pp. 5% x 84. 64830-3 Pa. $7.95 


LIGHT SCATTERING BY SMALL PARTICLES, H.C. van de Hulst. Compre- 
hensive treatment including full range of useful approximation methods for 
researchers in chemistry, meteorology and astronomy. 44 illustrations. 470pp. 5% x 87. 

64228-3 Pa. $12.95 


CONFORMAL MAPPING ON RIEMANN SURFACES, Harvey Cohn. Lucid, 
insightful book presents ideal coverage of subject. 334 exercises make book perfect 
for self-study. 55 figures. 352pp. 5% x 84. 64025-6 Pa. $11.95 


OPTICKS, Sir Isaac Newton. Newton’s own experiments with spectroscopy, colors, 
lenses, reflection, refraction, etc., in language the layman can follow. Foreword by 
Albert Einstein. 532pp. 5% x 8’. 60205-2 Pa. $12.95 


GENERALIZED INTEGRAL TRANSFORMATIONS, A.H. Zemanian. Graduate- 

level study of recent generalizations of the Laplace, Mellin, Hankel, K. Weierstrass, 

convolution and other simple transformations. Bibliography. 320pp. 5% x 84. 
65375-7 Pa. $8.95 


CATALOG OF DOVER BOOKS 


THE ELECTROMAGNETIC FIELD, Albert Shadowitz. Comprehensive under- 
graduate text covers basics of electric and magnetic fields, builds up to electromag- 
netic theory. Also related topics, including relativity. Over 900 problems. 768pp. 
5% x 84%. 65660-8 Pa. $18.95 


FOURIER SERIES, Georgi P. Tolstov. Translated by Richard A. Silverman. A valu- 
able addition to the literature on the subject, moving clearly from subject to subject 
and theorem to theorem. 107 problems, answers. 336pp. 5% x 8%. 63317-9 Pa. $9.95 


THEORY OF ELECTROMAGNETIC WAVE PROPAGATION, Charles Herach 
Papas. Graduate-level study discusses the Maxwell field equations, radiation from 
wire antennas, the Doppler effect and more. xiii + 244pp. 5% x 8%. 65678-0 Pa. $6.95 


DISTRIBUTION THEORY AND TRANSFORM ANALYSIS: An Introduction to 
Generalized Functions, with Applications, A.H. Zemanian. Provides basics of distri- 
bution theory, describes generalized Fourier and Laplace transformations. Numerous 
problems. 384pp. 5% x 84. 65479-6 Pa. $11.95 


THE PHYSICS OF WAVES, William C. Elmore and Mark A. Heald. Unique 
overview of classical wave theory. Acoustics, optics, electromagnetic radiation, more. 
Ideal as classroom text or for self-study. Problems. 477pp. 5% x 8%. 

64926-1 Pa. $13.95 


CALCULUS OF VARIATIONS WITH APPLICATIONS, George M. Ewing. 
Applications-oriented introduction to variational theory develops insight and pro- 
motes understanding of specialized books, research papers. Suitable for advanced 
undergraduate/graduate students as primary, supplementary text. 352pp. 5% x 8’. 
64856-7 Pa. $9.95 


A TREATISE ON ELECTRICITY AND MAGNETISM, James Clerk Maxwell. 
Important foundation work of modern physics. Brings to final form Maxwell’s theo- 
ry of electromagnetism and rigorously derives his general equations of field theory. 
1,084pp. 5% x 84. 60636-8, 60637-6 Pa., Two-vol. set $25.90 


AN INTRODUCTION TO THE CALCULUS OF VARIATIONS, Charles Fox. 

Graduate-level text covers variations of an integral, isoperimetrical problems, least 

action, special relativity, approximations, more. References. 279pp. 5% x 8'4. 
65499-0 Pa. $8.95 


HYDRODYNAMIC AND HYDROMAGNETIC STABILITY, S. Chandrasekhar. 
Lucid examination of the Rayleigh-Benard problem; clear coverage of the theory of 
instabilities causing convection. 704pp. 5% x 8%. 64071-X Pa. $14.95 


CALCULUS OF VARIATIONS, Robert Weinstock. Basic introduction covering 
isoperimetric problems, theory of elasticity, quantum mechanics, electrostatics, etc. 
Exercises throughout. 326pp. 5% x 8'4. 63069-2 Pa. $9.95 


DYNAMICS OF FLUIDS IN POROUS MEDIA, Jacob Bear. For advanced stu- 

dents of ground water hydrology, soil mechanics and physics, drainage and irrigation 

engineering and more. 335 illustrations. Exercises, with answers. 784pp. 6's x 9%. 
65675-6 Pa. $19.95 


CATALOG OF DOVER BOOKS 


NUMERICAL METHODS FOR SCIENTISTS AND ENGINEERS, Richard 
Hamming. Classic text stresses frequency approach in coverage of algorithms, poly- 
nomial approximation, Fourier approximation, exponential approximation, other 
topics. Revised and enlarged 2nd edition. 721pp. 5% x 84. 65241-6 Pa. $15.95 


THEORETICAL SOLID STATE PHYSICS, Vol. 1: Perfect Lattices in 
Equilibrium; Vol. II: Non-Equilibrium and Disorder, William Jones and Norman H. 
March. Monumental reference work covers fundamental theory of equilibrium 
properties of perfect crystalline solids, non-equilibrium properties, defects and dis- 
ordered systems. Appendices. Problems. Preface. Diagrams. Index. Bibliography. 
Total of 1,301pp. 5% x 8%. Two volumes. Vol. I: 65015-4 Pa. $16.95 

Vol. IT: 65016-2 Pa. $16.95 


OPTIMIZATION THEORY WITH APPLICATIONS, Donald A. Pierre. Broad 
spectrum approach to important topic. Classical theory of minima and maxima, cal- 
culus of variations, simplex technique and linear programming, more. Many prob- 
lems, examples. 640pp. 5% x 8'4. 65205-X Pa. $16.95 


THE CONTINUUM: A Critical Examination of the Foundation of Analysis, 
Hermann Weyl. Classic of 20th-century foundational research deals with the con- 
ceptual problem posed by the continuum. 156pp. 5% x 8'4. 67982-9 Pa. $6.95 


ESSAYS ON THE THEORY OF NUMBERS, Richard Dedekind. Two classic 
essays by great German mathematician: on the theory of irrational numbers; and on 
transfinite numbers and properties of natural numbers. 115pp. 5% x 8%. 

21010-3 Pa. $5.95 


THE FUNCTIONS OF MATHEMATICAL PHYSICS, Harry Hochstadt. 
Comprehensive treatment of orthogonal polynomials, hypergeometric functions, 
Hill’s equation, much more. Bibliography. Index. 322pp. 5% x 8%. 65214-9 Pa. $9.95 


NUMBER THEORY AND ITS HISTORY, Oystein Ore. Unusually clear, accessi- 
ble introduction covers counting, properties of numbers, prime numbers, much 
more. Bibliography. 380pp. 5% x 8%. 65620-9 Pa. $10.95 


THE VARIATIONAL PRINCIPLES OF MECHANICS, Cornelius Lanczos. 
Graduate level coverage of calculus of variations, equations of motion, relativistic 
mechanics, more. First inexpensive paperbound edition of classic treatise. Index. 
Bibliography. 418pp. 5% x 8. 65067-7 Pa. $12.95 


MATHEMATICAL TABLES AND FORMULAS, Robert D. Carmichael and 
Edwin R. Smith. Logarithms, sines, tangents, trig functions, powers, roots, recipro- 
cals, exponential and hyperbolic functions, formulas and theorems. 269pp. 5% x 84. 

60111-0 Pa. $6.95 


THEORETICAL PHYSICS, Georg Joos, with Ira M. Freeman. Classic overview 
covers essential math, mechanics, electromagnetic theory, thermodynamics, quan- 
tum mechanics, nuclear physics, other topics. First paperback edition. xxdii + 885pp. 
5% x 8. 65227-0 Pa. $21.95 


CATALOG OF DOVER BOOKS 


HANDBOOK OF MATHEMATICAL FUNCTIONS WITH FORMULAS, 
GRAPHS, AND MATHEMATICAL TABLES, edited by Milton Abramowitz and 
Irene A. Stegun. Vast compendium: 29 sets of tables, some to as high as 20 places. 
1,046pp. 8 x 104. 61272-4 Pa. $26.95 


MATHEMATICAL METHODS IN PHYSICS AND ENGINEERING, John W. 
Dettman. Algebraically based approach to vectors, mapping, diffraction, other topics 
in applied math. Also generalized functions, analytic function theory, more. 
Exercises. 448pp. 5% x 84. 65649-7 Pa. $10.95 


A SURVEY OF NUMERICAL MATHEMATICS, David M. Young and Robert 
Todd Gregory. Broad self-contained coverage of computer-oriented numerical algo- 
rithms for solving various types of mathematical problems in linear algebra, ordinary 
and partial, differential equations, much more. Exercises. Total of 1,248pp. 5% x 8'. 
Two volumes. Vol. I: 65691-8 Pa. $16.95 

Vol. II: 65692-6 Pa. $16.95 


TENSOR ANALYSIS FOR PHYSICISTS, J.A. Schouten. Concise exposition of the 
mathematical basis of tensor analysis, integrated with well-chosen physical examples 
of the theory. Exercises. Index. Bibliography. 289pp. 5%x 84. 65582-2 Pa. $8.95 


INTRODUCTION TO NUMERICAL ANALYSIS (2nd Edition), F.B. Hilde- 
brand. Classic, fundamental treatment covers computation, approximation, inter- 
polation, numerical differentiation and integration, other topics. 150 new problems. 
669pp. 5% x 84. 65363-3 Pa. $16.95 


INVESTIGATIONS ON THE THEORY OF THE BROWNIAN MOVEMENT, 
Albert Einstein. Five papers (1905-8) investigating dynamics of Brownian motion 
and evolving elementary theory. Notes by R. Fiirth. 122pp. 5% x 8%. 

60304-0 Pa. $5.95 


CATASTROPHE THEORY FOR SCIENTISTS AND ENGINEERS, Robert 
Gilmore. Advanced-level treatment describes mathematics of theory grounded in the 
work of Poincaré, R. Thom, other mathematicians. Also important applications to 
problems in mathematics, physics, chemistry and engineering. 1981 edition. 
References. 28 tables. 397 black-and-white illustrations. xvii + 666pp. 6% x 9's. 
67539-4 Pa. $17.95 


AN INTRODUCTION TO STATISTICAL THERMODYNAMICS, Terrell L. 

Hill. Excellent basic text offers wide-ranging coverage of quantum statistical mechan- 

ics, systems of interacting molecules, quantum statistics, more. 523pp. 5% x 8’. 
65242-4 Pa. $12.95 


STATISTICAL PHYSICS, Gregory H. Wannier. Classic text combines thermody- 
namics, statistical mechanics and kinetic theory in one unified presentation of ther- 
mal physics. Problems with solutions. Bibliography. 532pp. 5% x 8%. 

65401-X Pa. $12.95 


CATALOG OF DOVER BOOKS 


ORDINARY DIFFERENTIAL EQUATIONS, Morris Tenenbaum and Harry 
Pollard. Exhaustive survey of ordinary differential equations for undergraduates in 
mathematics, engineering, science. Thorough analysis of theorems. Diagrams. 
Bibliography. Index. 818pp. 5% x 84. 64940-7 Pa. $18.95 


STATISTICAL MECHANICS: Principles and Applications, Terrell L. Hill. 
Standard text covers fundamentals of statistical mechanics, applications to fluctuation 
theory, imperfect gases, distribution functions, more. 448pp. 5% x 8's. 

65390-0 Pa. $11.95 


ORDINARY DIFFERENTIAL EQUATIONS AND STABILITY THEORY: An 
Introduction, David A. Sanchez. Brief, modern treatment. Linear equation, stability 
theory for autonomous and nonautonomous systems, etc. 164pp. 5% x 8. 

63828-6 Pa. $6.95 


THIRTY YEARS THAT SHOOK PHYSICS: The Story of Quantum Theory, 
George Gamow. Lucid, accessible introduction to influential theory of energy and 
matter. Careful explanations of Dirac’s anti-particles, Bohr’s model of the atom, 
much more. 12 plates. Numerous drawings. 240pp. 5% x 8. 24895-X Pa. $7.95 


THEORY OF MATRICES, Sam Perlis. Outstanding text covering rank, nonsingu- 
larity and inverses in connection with the development of canonical matrices under 
the relation of equivalence, and without the intervention of determinants. Includes 
exercises. 237pp. 5% x 844. 66810-X Pa. $8.95 


GREAT EXPERIMENTS IN PHYSICS: Firsthand Accounts from Galileo to 
Einstein, edited by Morris H. Shamos. 25 crucial discoveries: Newton’s laws of 
motion, Chadwick’s study of the neutron, Hertz on electromagnetic waves, more. 
Original accounts clearly annotated. 370pp. 5% x 8%. 25346-5 Pa. $10.95 


INTRODUCTION TO PARTIAL DIFFERENTIAL EQUATIONS WITH 
APPLICATIONS, E.C. Zachmanoglou and Dale W. Thoe. Essentials of partial dif- 
ferential equations applied to common problems in engineering and the physical sci- 
ences. Problems and answers. 416pp. 5% x 8'4. 65251-3 Pa. $11.95 


BURNHAM’S CELESTIAL HANDBOOK, Robert Burnham, Jr. Thorough guide 
to the stars beyond our solar system. Exhaustive treatment. Alphabetical by constel- 
lation: Andromeda to Cetus in Vol. 1; Chamaeleon to Orion in Vol. 2; and Pavo to 
Vulpecula in Vol. 3. Hundreds of illustrations. Index in Vol. 3. 2,000pp. 6% x 94%. 
23567-X, 23568-8, 23673-0 Pa., Three-vol. set $44.85 


CHEMICAL MAGIC, Leonard A. Ford. Second Edition, Revised by E. Winston 
Grundmeier. Over 100 unusual stunts demonstrating cold fire, dust explosions, 
much more. Text explains scientific principles and stresses safety precautions. 
128pp. 5% x 8'4. 67628-5 Pa. $5.95 


AMATEUR ASTRONOMER'’S HANDBOOK, J.B. Sidgwick. Timeless, compre- 
hensive coverage of telescopes, mirrors, lenses, mountings, telescope drives, 
micrometers, spectroscopes, more. 189 illustrations. 576pp. 5% x 8%. (Available in 
US. only) 24034-7 Pa. $11.95 


CATALOG OF DOVER BOOKS 


SPECIAL FUNCTIONS, N.N. Lebedev. Translated by Richard Silverman. Famous 
Russian work treating more important special functions, with applications to specific 
problems of physics and engineering. 38 figures. 308pp. 5% x 8%. 60624-4 Pa. $9.95 


OBSERVATIONAL ASTRONOMY FOR AMATEURS, J.B. Sidgwick. Mine of 
useful data for observation of sun, moon, planets, asteroids, aurorae, meteors, 
comets, variables, binaries, etc. 39 illustrations. 384pp. 5% x 8%. (Available in U.S. 
only) 24033-9 Pa. $8.95 


INTEGRAL EQUATIONS, EG. Tricomi. Authoritative, well-written treatment of 
extremely useful mathematical tool with wide applications. Volterra Equations, 
Fredholm Equations, much more. Advanced undergraduate to graduate level. 
Exercises. Bibliography. 238pp. 5% x 8'4. 64828-1 Pa. $8.95 


POPULAR LECTURES ON MATHEMATICAL LOGIC, Hao Wang. Noted logi- 
cian’s lucid treatment of historical developments, set theory, model theory, recursion 
theory and constructivism, proof theory, more. 3 appendixes. Bibliography. 1981 edi- 
tion. ix + 283pp. 5% x 84. 67632-3 Pa. $8.95 


MODERN NONLINEAR EQUATIONS, Thomas L. Saaty. Emphasizes practical 
solution of problems; covers seven types of equations. “. . . a welcome contribution 
to the existing literature...."—Math Reviews. 490pp. 5% x 8'4. 64232-1 Pa. $13.95 


FUNDAMENTALS OF ASTRODYNAMICS, Roger Bate et al. Modern approach 
developed by U.S. Air Force Academy. Designed as a first course. Problems, exer- 
cises. Numerous illustrations. 455pp. 5% x 8/4. 60061-0 Pa. $10.95 


INTRODUCTION TO LINEAR ALGEBRA AND DIFFERENTIAL EQUA- 
TIONS, John W. Dettman. Excellent text covers complex numbers, determinants, 
orthonormal bases, Laplace transforms, much more. Exercises with solutions. 
Undergraduate level. 416pp. 5% x 8's. 65191-6 Pa. $11.95 


INCOMPRESSIBLE AERODYNAMICS, edited by Bryan Thwaites. Covers theo- 
retical and experimental treatment of the uniform flow of air and viscous fluids past 
two-dimensional aerofoils and three-dimensional wings; many other topics. 654pp. 
5% x 8h. 65465-6 Pa. $16.95 


INTRODUCTION TO DIFFERENCE EQUATIONS, Samuel Goldberg. Excep- 
tionally clear exposition of important discipline with applications to sociology, psy- 
chology, economics. Many illustrative examples; over 250 problems. 260pp. 5% x 8’. 

65084-7 Pa. $8.95 


LAMINAR BOUNDARY LAYERS, edited by L. Rosenhead. Engineering classic 
covers steady boundary layers in two- and three- dimensional flow, unsteady bound- 
ary layers, stability, observational techniques, much more. 708pp. 5% x 84. 

65646-2 Pa. $18 95 


LECTURES ON CLASSICAL DIFFERENTIAL GEOMETRY, Second Edition, 
Dirk J. Struik. Excellent brief introduction covers curves, theory of surfaces, funda- 
mental equations, geometry on a surface, conformal mapping, other topics. 
Problems. 240pp. 5% x 8%. 65609-8 Pa. $8.95 


CATALOG OF DOVER BOOKS 


ROTARY-WING AERODYNAMICS, W.Z. Stepniewski. Clear, concise text covers 

aerodynamic phenomena of the rotor and offers guidelines for helicopter perfor- 

mance evaluation. Originally prepared for NASA. 537 figures. 640pp. 6% x 9%. 
64647-5 Pa. $16.95 


DIFFERENTIAL GEOMETRY, Heinrich W. Guggenheimer. Local differential 
geometry as an application of advanced calculus and linear algebra. Curvature, trans- 
formation groups, surfaces, more. Exercises. 62 figures. 378pp. 5% x 8’. 

63433-7 Pa. $9.95 


INTRODUCTION TO SPACE DYNAMICS, William Tyrrell Thomson. Com- 
prehensive, classic introduction to space-flight engineering for advanced undergrad- 
uate and graduate students. Includes vector algebra, kinematics, transformation of 
coordinates. Bibliography. Index. 352pp. 5% x 84. 65113-4 Pa. $9.95 


A SURVEY OF MINIMAL SURFACES, Robert Osserman. Up-to-date, in-depth 
discussion of the field for advanced students. Corrected and enlarged edition covers 
new developments. Includes numerous problems. 192pp. 5% x 84. 64998-9 Pa. $8.95 


ANALYTICAL MECHANICS OF GEARS, Earle Buckingham. Indispensable ref- 
erence for modern gear manufacture covers conjugate gear-tooth action, gear-tooth 
profiles of various gears, many other topics. 263 figures. 102 tables. 546pp. 5% x 84. 

65712-4 Pa. $14.95 


SET THEORY AND LOGIC, Robert R. Stoll. Lucid introduction to unified theory 
of mathematical concepts. Set theory and logic seen as tools for conceptual under- 
standing of real number system. 496pp. 5% x 8%. 63829-4 Pa. $12.95 


A HISTORY OF MECHANICS, René Dugas. Monumental study of mechanical 
principles from antiquity to quantum mechanics. Contributions of ancient Greeks, 
Galileo, Leonardo, Kepler, Lagrange, many others. 671pp. 5% x 84. 

65632-2 Pa. $14.95 


FAMOUS PROBLEMS OF GEOMETRY AND HOW TO SOLVE THEM, 
Benjamin Bold. Squaring the circle, trisecting the angle, duplicating the cube: learn 
their history, why they are impossible to solve, then solve them yourself. 128pp. 5% 
x 84h. 24297-8 Pa. $4.95 


MECHANICAL VIBRATIONS, J.P. Den Hartog. Classic textbook offers lucid 
explanations and illustrative models, applying theories of vibrations to a variety of 
practical industrial engineering problems. Numerous figures. 233 problems, solu- 
tions. Appendix. Index. Preface. 436pp. 5% x 8'. 64785-4 Pa. $11.95 


CURVATURE AND HOMOLOGY, Samuel I. Goldberg. Thorough treatment of 
specialized branch of differential geometry. Covers Riemannian manifolds, topology 
of differentiable manifolds, compact Lie groups, other topics. Exercises. 315pp. 5% x 
8k. 64314-X Pa. $9.95 


HISTORY OF STRENGTH OF MATERIALS, Stephen P. Timoshenko. Excellent 
historical survey of the strength of materials with many references to the theories of 
elasticity and structure. 245 figures. 452pp. 5% x 814. 61187-6 Pa. $12.95 


CATALOG OF DOVER BOOKS 


GEOMETRY OF COMPLEX NUMBERS, Hans Schwerdtfeger. Illuminating, 
widely praised book on analytic geometry of circles, the Moebius transformation, 
and two-dimensional non-Euclidean geometries. 200pp. 5°4 x 8%. 63830-8 Pa. $8.95 


MECHANICS, J.P. Den Hartog. A classic introductory text or refresher. Hundreds 
of applications and design problems illuminate fundamentals of trusses, loaded 
beams and cables, etc. 334 answered problems. 462pp. 5% x 8%. 60754-2 Pa. $11.95 


TOPOLOGY, John G. Hocking and Gail S. Young. Superb one-year course in clas- 
sical topology. Topological spaces and functions, point-set topology, much more. 
Examples and problems. Bibliography. Index. 384pp. 5% x 8%. 65676-4 Pa. $10.95 


STRENGTH OF MATERIALS, J.P. Den Hartog. Full, clear treatment of basic mate- 
rial (tension, torsion, bending, etc.) plus advanced material on engineering methods, 
applications. 350 answered problems. 323pp. 5% x 8'4. 60755-0 Pa. $9.95 


ELEMENTARY CONCEPTS OF TOPOLOGY, Paul Alexandroff. Elegant, intu- 
itive approach to topology from set-theoretic topology to Betti groups; how concepts 
of topology are useful in math and physics. .25 figures. 57pp. 5% x 8’. 

60747-X Pa. $3.95 


ADVANCED STRENGTH OF MATERIALS, J.P. Den Hartog. Superbly written 
advanced text covers torsion, rotating disks, membrane stresses in shells, much more. 
Many problems and answers. 388pp. 5% x 8's. 65407-9 Pa. $10.95 


COMPUTABILITY AND UNSOLVABILITY, Martin Davis. Classic graduate- 
level introduction to theory of computability, usually referred to as theory of recur- 
rent functions. New preface and appendix. 288pp. 5% x 8’. 61471-9 Pa. $8.95 


GENERAL CHEMISTRY, Linus Pauling. Revised 3rd edition of classic first-year 
text by Nobel laureate. Atomic and molecular structure, quantum mechanics, statis- 
tical mechanics, thermodynamics correlated with descriptive chemistry. Problems. 
992pp. 5% x 8's. 65622-5 Pa. $19.95 


AN INTRODUCTION TO MATRICES, SETS AND GROUPS FOR SCIENCE 
STUDENTS, G. Stephenson. Concise, readable text introduces sets, groups, and 
most importantly, matrices to undergraduate students of physics, chemistry, and 
engineering. Problems. 164pp. 5% x 84. 65077-4 Pa. $7.95 


THE HISTORICAL BACKGROUND OF CHEMISTRY, Henry M. Leicester. 
Evolution of ideas, not individual biography. Concentrates on formulation of a coher- 
ent set of chemical laws. 260pp. 5% x 84. 61053-5 Pa. $8.95 


THE PHILOSOPHY OF MATHEMATICS: An Introductory Essay, Stephan 
K6rner. Surveys the views of Plato, Aristotle, Leibniz & Kant concerning proposi- 
tions and theories of applied and pure mathematics. Introduction. Two appendices. 
Index. 198pp. 5% x 814. 25048-2 Pa. $8.95 


THE DEVELOPMENT OF MODERN CHEMISTRY, Aaron J. Ihde. Authorita- 
tive history of chemistry from ancient Greek theory to 20th-century innovation. 
Covers major chemists and their discoveries. 209 illustrations. 14 tables. 
Bibliographies. Indices. Appendices. 851pp. 5% x 84. 64235-6 Pa. $18.95 


CATALOG OF DOVER BOOKS 


DE RE METALLICA, Georgius Agricola. The famous Hoover translation of great- 
est treatise on technological chemistry, engineering, geology, mining of early mod- 
ern times (1556). All 289 original woodcuts. 638pp. 6% x 11. 60006-8 Pa. $21.95 


SOME THEORY OF SAMPLING, William Edwards Deming. Analysis of the 
problems, theory and design of sampling techniques for social scientists, industrial 
managers and others who find statistics increasingly important in their work. 61 
tables. 90 figures. xvii + 602pp. 5% x 84. 64684-X Pa. $16.95 


THE VARIOUS AND INGENIOUS MACHINES OF AGOSTINO RAMELLI: 
A Classic Sixteenth-Century Illustrated Treatise on Technology, Agostino Ramelli. 
One of the most widely known and copied works on machinery in the 16th century. 
194 detailed plates of water pumps, grain mills, cranes, more. 608pp. 9 x 12. 
28180-9 Pa. $24.95 


LINEAR PROGRAMMING AND ECONOMIC ANALYSIS, Robert Dorfman, 
Paul A. Samuelson and Robert M. Solow. First comprehensive treatment of linear 
programming in standard economic analysis. Game theory, modern welfare eco- 
nomics, Leontief input-output, more. 525pp. 5% x 8'. 65491-5 Pa. $14.95 


ELEMENTARY DECISION THEORY, Herman Chernoff and Lincoln E. 
Moses. Clear introduction to statistics and statistical theory covers data process- 
ing, probability and random variables, testing hypotheses, much more. Exercises. 
364pp. 5% x 8's. 65218-1 Pa. $10.95 


THE COMPLEAT STRATEGYST: Being a Primer on the Theory of Games of 
Strategy, J.D. Williams. Highly entertaining classic describes, with many illustrated 
examples, how to select best strategies in conflict situations. Prefaces. Appendices. 
268pp. 5% x 814. 25101-2 Pa. $7.95 


CONSTRUCTIONS AND COMBINATORIAL PROBLEMS IN DESIGN OF 
EXPERIMENTS, Damaraju Raghavarao. In-depth reference work examines 
orthogonal Latin squares, incomplete block designs, tactical configuration, partial 
geometry, much more. Abundant explanations, examples. 416pp. 5% x 844. 

65685-3 Pa. $10.95 


THE ABSOLUTE DIFFERENTIAL CALCULUS (CALCULUS OF TENSORS), 
Tullio Levi-Civita. Great 20th-century mathematician’s classic work on material nec- 
essary for mathematical grasp of theory of relativity. 452pp. 5% x 84. 

63401-9 Pa. $11.95 


VECTOR AND TENSOR ANALYSIS WITH APPLICATIONS, A.I. Borisenko 
and I.E. Tarapov. Concise introduction. Worked-out problems, solutions, exercises. 
257pp. 5% x 84. 63833-2 Pa. $8.95 


THE FOUR-COLOR PROBLEM: Assaults and Conquest, Thomas L. Saaty and 
Paul G. Kainen. Engrossing, comprehensive account of the century-old combinator- 
ial topological problem, its history and solution. Bibliographies. Index. 110 figures. 
228pp. 5% x 8's. 65092-8 Pa. $7.95 


CATALOG OF DOVER BOOKS 


CATALYSIS IN CHEMISTRY AND ENZYMOLOGY, William P. Jencks. 

Exceptionally clear coverage of mechanisms for catalysis, forces in aqueous solution, 

carbonyl- and acyl-group reactions, practical kinetics, more. 864pp. 5% x 84. 
65460-5 Pa. $19.95 


PROBABILITY: An Introduction, Samuel Goldberg. Excellent basic text covers set 
theory, probability theory for finite sample spaces, binomial theorem, much more. 
360 problems. Bibliographies. 322pp. 5% x 84. 65252-1 Pa. $10.95 


LIGHTNING, Martin A. Uman. Revised, updated edition of classic work on the 
physics of lightning. Phenomena, terminology, measurement, photography, spec- 
troscopy, thunder, more. Reviews recent research. Bibliography. Indices. 320pp. 
5% x 8h. 64575-4 Pa. $8.95 


PROBABILITY THEORY: A Concise Course, Y.A. Rozanov. Highly readable, self- 
contained introduction covers combination of events, dependent events, Bernoulli 
trials, etc. Translation by Richard Silverman. 148pp. 5% x.8%. 63544-9 Pa. $7.95 


AN INTRODUCTION TO HAMILTONIAN OPTICS, H. A. Buchdahl. Detailed 
account of the Hamiltonian treatment of aberration theory in geometrical optics. 
Many classes of optical systems defined in terms of the symmetries they possess. 
Problems with detailed solutions. 1970 edition. xv + 360pp. 5% x 8's. 

67597-1 Pa. $10.95 


STATISTICS MANUAL, Edwin L. Crow, et al. Comprehensive, practical collection 
of classical and modern methods prepared by U.S. Naval Ordnance Test Station. 
Stress on use. Basics of statistics assumed. 288pp. 5% x 8. 60599-X Pa. $7.95 


DICTIONARY/OUTLINE OF BASIC STATISTICS, John E. Freund and Frank J. 
Williams. A clear concise dictionary of over 1,000 statistical terms and an outline of 
statistical formulas covering probability, nonparametric tests, much more. 208pp. 
5% x 8's. 66796-0 Pa. $7.95 


STATISTICAL METHOD FROM THE VIEWPOINT OF QUALITY CON- 
TROL, Walter A. Shewhart. Important text explains regulation of variables, uses of 
statistical control to achieve quality control in industry, agriculture, other areas. 
192pp. 5% x 84. 65232-7 Pa. $7.95 


METHODS OF THERMODYNAMICS, Howard Reiss. Outstanding text focuses 
on physical technique of thermodynamics, typical problem areas of understanding, 
and significance and use of thermodynamic potential. 1965 edition. 238pp. 5% x 8%. 

69445-3 Pa. $8.95 


STATISTICAL ADJUSTMENT OF DATA, W. Edwards Deming. Introduction to 
basic concepts of statistics, curve fitting, least squares solution, conditions without para- 
meter, conditions containing parameters. 26 exercises worked out. 27lpp. 5% x 84. 

64685-8 Pa. $9.95 


TENSOR CALCULUS, J.L. Synge and A. Schild. Widely used introductory text 
covers spaces and tensors, basic operations in Riemannian space, non-Riemannian 
spaces, etc. 324pp. 5% x 814. 63612-7 Pa. $9.95 


CATALOG OF DOVER BOOKS 


A CONCISE HISTORY OF MATHEMATICS, Dirk J. Struik. The best brief his- 
tory of mathematics. Stresses origins and covers every major figure from ancient 
Near East to 19th century. 41 illustrations. 195pp. 5% x 84. 60255-9 Pa. $8.95 


A SHORT ACCOUNT OF THE HISTORY OF MATHEMATICS, W.W. Rouse 
Ball. One of clearest, most authoritative surveys from the Egyptians and Phoenicians 
through 19th-century figures such as Grassman, Galois, Riemann. Fourth edition. 
522pp. 5% x 8's. 20630-0 Pa. $11.95 


HISTORY OF MATHEMATICS, David E. Smith. Nontechnical survey from 
ancient Greece and Orient to late 19th century; evolution of arithmetic, geometry, 
trigonometry, calculating devices, algebra, the calculus. 362 illustrations. 1,355pp. 
5% x 84h. 20429-4, 20430-8 Pa., Two-vol. set $26.90 


THE GEOMETRY OF RENE DESCARTES, René Descartes. The great work 
founded analytical geometry. Original French text, Descartes’ own diagrams, togeth- 
er with definitive Smith-Latham translation. 244pp. 5% x 84. 60068-8 Pa. $8.95 


THE ORIGINS OF THE INFINITESIMAL CALCULUS, Margaret E. Baron. 
Only fully detailed and documented account of crucial discipline: origins; develop- 
ment by Galileo, Kepler, Cavalieri; contributions of Newton, Leibniz, more. 304pp. 
5% x 8'4. (Available in U.S. and Canada only) 65371-4 Pa. $9.95 


THE HISTORY OF THE CALCULUS AND ITS CONCEPTUAL DEVELOP- 
MENT, Carl B. Boyer. Origins in antiquity, medieval contributions, work of Newton, 
Leibniz, rigorous formulation. Treatment is verbal. 346pp. 5% x 8'. 60509-4 Pa. $9.95 


THE THIRTEEN BOOKS OF EUCLID’S ELEMENTS, translated with introduc- 
tion and commentary by Sir Thomas L. Heath. Definitive edition. Textual and lin- 
guistic notes, mathematical analysis. 2,500 years of critical commentary. Not 
abridged. 1,414pp. 5% x 8%. 60088-2, 60089-0, 60090-4 Pa., Three-vol. set $32.85 


GAMES AND DECISIONS: Introduction and Critical Survey, R. Duncan Luce 
and Howard Raiffa. Superb nontechnical introduction to game theory, primarily 
applied to social sciences. Utility theory, zero-sum games, n-person games, decision- 
making, much more. Bibliography. 509pp. 5% x 8'4. 65943-7 Pa. $13.95 


THE HISTORICAL ROOTS OF ELEMENTARY MATHEMATICS, Lucas N.H. 
Bunt, Phillip S. Jones, and Jack D. Bedient. Fundamental underpinnings of modern 
arithmetic, algebra, geometry and number systems derived from ancient civiliza- 
tions. 320pp. 5% x 8's. 25563-8 Pa. $8.95 


CALCULUS REFRESHER FOR TECHNICAL PEOPLE, A. Albert Klaf. Covers 
important aspects of integral and differential calculus via 756 questions. 566 prob- 
lems, most answered. 43lpp. 5% x 8’. 20370-0 Pa. $8.95 


CATALOG OF DOVER BOOKS 


CHALLENGING MATHEMATICAL PROBLEMS WITH ELEMENTARY 
SOLUTIONS, A.M. Yaglom and I.M. Yaglom. Over 170 challenging problems on 
probability theory, combinatorial analysis, points and lines, topology, convex poly- 
gons, many other topics. Solutions. Total of 445pp. 5% x 8%. Two-vol. set. 
Vol. I: 65536-9 Pa. $7.95 
Vol. II: 65537-7 Pa. $7.95 


FIFTY CHALLENGING PROBLEMS IN PROBABILITY WITH SOLUTIONS, 
Frederick Mosteller. Remarkable puzzlers, graded in difficulty, illustrate elementary 
and advanced aspects of probability. Detailed solutions. 88pp. 5% x 8’. 

65355-2 Pa. $4.95 


EXPERIMENTS IN TOPOLOGY, Stephen Barr. Classic, lively explanation of one 
of the byways of mathematics. Klein bottles, Moebius strips, projective planes, map 
coloring, problem of the Koenigsberg bridges, much more, described with clarity 
and wit. 43 figures. 210pp. 5% x 8. 25933-1 Pa. $6.95 


RELATIVITY IN ILLUSTRATIONS, Jacob T. Schwartz. Clear nontechnical treat- 
ment makes relativity more accessible than ever before. Over 60 drawings illustrate 
concepts more clearly than text alone. Only high school geometry needed. 
Bibliography. 128pp. 6% x 94. 25965-X Pa. $7.95 


AN INTRODUCTION TO ORDINARY DIFFERENTIAL EQUATIONS, Earl 
A. Coddington. A thorough and systematic first course in elementary differential 
equations for undergraduates in mathematics and science, with many exercises and 
problems (with answers). Index. 304pp. 5% x 8'4. 65942-9 Pa. $8.95 


FOURIER SERIES AND ORTHOGONAL FUNCTIONS, Harry F. Davis. An 
incisive text combining theory and practical example to introduce Fourier series, 
orthogonal functions and applications of the Fourier method to boundary-value 
problems. 570 exercises. Answers and notes. 416pp. 5% x 8%.  65973-9 Pa. $11.95 


AN INTRODUCTION TO ALGEBRAIC STRUCTURES, Joseph Landin. Superb 
self-contained text covers “abstract algebra”: sets and numbers, theory of groups, the- 
ory of rings, much more. Numerous well-chosen examples, exercises. 247pp. 5% x 8'4. 

65940-2 Pa. $8.95 


STARS AND RELATIVITY, Ya. B. Zel’dovich and I. D. Novikov. Vol. 1 of 
Relativistic Astrophysics by famed Russian scientists. General relativity, properties of 
matter under astrophysical conditions, stars and stellar systems. Deep physical 
insights, clear presentation. 1971 edition. References. 544pp. 5% x 87. 

69424-0 Pa. $14.95 


Prices subject to change without notice. 
Available at your book dealer or write for free Mathematics and Science Catalog to Dept. Gl, 
Dover Publications, Inc., 31 East 2nd St., Mineola, N.Y. 11501. Dover publishes more than 250 
books each year on science, elementary and advanced mathematics, biology, mus’ 
ture, history, social sciences and other areas. 


(continued from front flap) 


FUNDAMENTAL FORMULAS OF Puysics, VOLs. I & II, Donald H. Menzel (ed.). 
(60595-7, 60596-5) $23.90 


OpTIcKS, Sir Isaac Newton. (60205-2) $12.95 

INTRODUCTION TO STATISTICAL Optics, Edward L. O’Neill. (67328-6) $6.95 

THEORY OF RELATIVITY, W. Pauli. (64152-X) 

INTRODUCTION TO QUANTUM MECHANICS WITH APPLICATIONS TO CHEMISTRY, 
Linus Pauling and E. Bright Wilson, Jr. (64871-0) $12.95 

THE PHILOSOPHY OF SPACE AND TIME, Hans Reichenbach. (60443-8) $8.95 


BOUNDARY AND EIGENVALUE PROBLEMS IN MATHEMATICAL Puysics, Hans Sagan. 
(66132-6) $9.95 


STATISTICAL THERMODYNAMICS, Erwin Schrodinger. (66101-6) $5.95 

PRINCIPLES OF ELECTRODYNAMICS, Melvin Schwartz. (65493-1) $10.95 

ELECTROMAGNETIC FIELD, Albert Shadowitz. (65660-8) $18.95 

PARTIAL DIFFERENTIAL EQUATIONS OF MATHEMATICAL Puysics, S.L. Sobolev. 
(65964-X) $12.95 

PRINCIPLES OF STATISTICAL MECHANICS, Richard C. Tolman. (63896-0) $15.95 

UNDERSTANDING THERMODYNAMICS, H.C. Van Ness. (63277-6) $5.95 

LIGHT SCATTERING BY SMALL PARTICLES, H.C. van de Hulst. (64228-3) $11.95 

MATHEMATICAL ANALYSIS OF PHYSICAL PROBLEMS, Philip R. Wallace. (64676-9) 


STATISTICAL Puysics, Gregory H. Wannier. (65401-X) $12.95 
X-Ray DIFFRACTION, B.E. Warren. (66317-5) $11.95 
SPACE—TIME—MATTER, Hermann Weyl. (60267-2) $9.95 


Paperbound unless otherwise indicated. Prices subject to change with- 
out notice. Available at your book dealer or write for free catalogues to 
Dept. 23, Dover Publications, Inc., 31 East 2nd Street, Mineola, N.Y. 
11501. Please indicate field of interest. Each year Dover publishes over 
200 books on fine art, music, crafts and needlework, antiques, lan- 
guages, literature, children’s books, chess, cookery, nature, anthropol- 
ogy, science, mathematics, and other areas. 

Manufactuzcz in the U.S.A. 


MATHEMATICAL ANALYSIS 
OF PHYSICAL PROBLEMS 


Philip R. Wallace 


Intended for the advanced undergraduate or beginning graduate student, this 
lucid work links classical and modern physics through common techniques and 
concepts and acquaints the reader with a variety of mathematical tools physicists 
use to describe and comprehend the physical universe. 


For the physicist, mathematics is a language, or shorthand, for constructing 
workable models (necessarily approximate and incomplete) of aspects of physical 
reality. The present text, by a noted professor of physics at McGill University, 
Montreal, deals in an exceptionally well-organized way with some of the crucial 
mathematical tools used to construct such models. 


Contents include: 


I. The Vibrating String 

II. Linear Vector Spaces 

III. The Potential Equation 

IV. Fourier and Laplace Transforms and Their Applications 
V. Propagation and Scattering of Waves 

VI. Problems of Diffusion and Attenuation 

VII. Probability and Stochastic Processes 

VIII. Fundamental Principles of Quantum Mechanics 

IX. Some Soluble Problems of Quantum Mechanics 

X. Quantum Mechanics of Many-body Problems 


A special helpful feature of this volume is a Prelude to each chapter, which 
outlines the topics with which the chapter deals. In addition to providing a guide 
to the organization of its contents, it indicates the mathematical background 
assumed and calls attention to those methods and concepts which have an 
application in different physical problems. Relevant test problems are interspersed 
throughout the text to test the student’s grasp of the material, while brief 
bibliographies at the chapter ends suggest further reading. 


Ideal as a primary or supplementary text, Mathematical Analysis of Physical 
Problems will reward any reader seeking a firmer grasp of the mathematical 
procedures by which physicists unlock the secrets of the universe. 


Unabridged and corrected Dover (1984) republication of the work originally 
published by Holt, Rinehart and Winston, Inc., New York, 1972. Preface. 
Bibliographies. Index. 34 figures. Problems (no solutions). xii + 616pp. 5% x 8%. 
Paperbound. 


ISBN O0-466-6b4b 76-4 


| 90000 
OSG 63 | 


Sl?-45 INUSA 80486 6467 


Apsuusy “q [neg Aq uBisap 19075 


